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(57) Abstract 

The invention consists of: (1) cloned Sorangium cellulosum polyketide synthase (PKS) biosynthetic cluster DNA; and (2) the 
nucleotide sequence and predicted protein coding sequences of the cloned DNA. The invention can be used for, but not limited to: 
(a) increasing yields of PKS product in Sorangium cellulosum (e.g., by amplification or genetic modification of the epothilon 
gene cluster or its component parts); (b) increasing yields of polyketide product in a heterologous system by transfer of the 
epothilone gene cluster or its component parts, which may be followed by amplification or genetic modification of the PKS gene 
cluster or its component parts; (c) modification of the polyketide product chemical structure in either Sorangium cellulosum or a 
heterologous host (e.g., by genetic modification of the epothilone gene cluster or its component parts; and (d) for the detection 
of genes and gene products involved in making polyketides or related molecules in other organisms (e.g., by hybridization or 
complementation assays). DNA sequence and analysis is presented for the following cosmids and plasmids: A2 cosmid; the 
pEPOcos6 region (overlapping of pEPOcos6 and pEPOcos7); pEPOcos8 cosmid; AS cosmid; Sau4 (10 kb plasmid). 

(57) Abrege 

L'invention concerne: (1) un ADN biosynthetique clone en grappe de polyketide synthase (PKS) de Sorangium cellulosum; et (2) 
la sequence nucleotidique et les sequences proteiques codantes prevues de i'ADN clone. L'invention peut avoir les applications 
suivantes (sans caractere limitatif): (a) augmentation de la production de PKS chez Sorangium cellulosum (p.ex., par 
("amplification ou la modification genetique de la grappe de genes epothilone ou de ces parties constitutives); (b) augmentation 
de la production du produit polyketide dans un systeme heterologue par le transfert de la grappe de genes epothilone ou de ces 
parties constitutives, qui peut etre suivie par ('amplification ou la modification genetique de la grappe de genes PKS ou des ses 
parties constitutives; (c) modification de la structure chimique du produit polyketide sort chez Sorangium cellulosum soit chez 
un note heterologue (p.ex., par ('amplification ou la modification genetique de la grappe de genes epothilone ou de ces parties 
constitutives); et (d) detection de genes et de produits geniques participant a la fabrication de polyketides ou de molecules 
correspondantes dans d'autres organismes (p.ex., par des dosages & hybridation ou £ complementation). La sequence d'ADN et 
I'anatyse sont presentees pour les cosmides et les plasmides suivants: cosmide A2; region pEPOcos6 (se chevauchant avec 
pEPOcos6 et pEPOcos7); cosmide pEPOcos8; cosmide A5; Sau4 (plasmide 10 kb). 
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DNA sequences for enzymatic synthesis of polyketide or 

heteropolyketide compounds 

The present invention relates to DMA sequences for enzy- 
matic synthesis of polyketide or heteropolyketide compounds 
produced by the bacterium Sorangium cellulosum. 

Background and introduction 



This patent application describes DNA sequences for the 
enzymatic synthesis of polyketide and/or heteropolyketide 
structures synthesized by the myxobacterium Sorangium cellulo- 
sum. Several of these compounds have known cytotoxic, immuno- 
suppressive, antibiotic and fungicidal biological activity, 
with the epothilones having been most studied and character- 
ized. The fermentation of large quantities of secondary me- 
tabolites from microorganisms, especially from myxobacteria, is 
a time consuming and difficult process that often involves com- 
plications (e.g. contamination, low product yield, difficult 
isolation and purification) . Therefore it would be advanta- 
geous to use a well -characterized organism for such fermenta- 
tions. After cloning of the desired biosynthetic genes one 
could create such an organism via genetic engineering and ma- 
nipulate the biosynthesis of the compound. Identified sequences 
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can be cloned into optimized expression vectors and generate 
recombinant cell lines that overproduce polyketide structures. 

Polyketide synthases (PKS) and non-ribosomal peptide syn- 
thetases (NRPS) represent macromolecular and multifunctional 
enzymes which are characterized by a modular architecture. PKS 
condenses activated carbonic acids (usually acetate and propi- 
onate) and reduce the resulting 2-keto acid intermediates step- 
wise in a fatty acid biosynthesis -like fashion. Responsible for 
each reaction step is a specific domain that recognizes, acti- 
vates, condenses and reduces the carbonic acid. Depending on 
the presence of these domains in the corresponding modules, 
every reduction stage can occur in the final product (Rawlings, 
Nat. Prod. Reports 14, 523-556 [1997]; for a review, see Chem. 
Rev. 12, 2463-2760 [1997]). a typical example for the biosyn- 
thesis of a polyketide is the macrolide antibiotic erythromycin 
(Staunton and Wilkinson, Chem. Rev. 97, 2611-2630 [1997] ) 
NRPSs are also modular enzymes and condense via peptide bonds 
amino acids to low molecular weight bioactive substances like 
bacitracin or tyrocidin. Typical domains of these systems acti- 
vate the amino acid and condense it with the growing peptide 
chain. Methylations, epimerisations and modifications via addi- 
tional protein domains are possible (Stachelhaus and Marahiel, 
FEMS Microbiol Lett. JUS., 3-14 [1995]). Both types of enzymes 
(NRPS and PKS) share the modular organization of the proteins 
in which specific catalytic domains are responsible for recog- 
nition, activation, condensation and modification of the single 
elongation units. The growing chain of amino acids and/or car- 
bonic acids is extended through the action of one module adding 
one unit. The domains of each module carry the active centers 
responsible for the enzymatic steps of the biosynthesis. 
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Little is known about the biosynthesis of biologically ac- 
tive polyketides and polypeptides from myxobacteria . Fragments 
of the biosynthetic gene clusters of soraphen and saframycin 
have been described from Sorangium cellulosum So ce26 and Myxo- 
ccccus xanthus, respectively (Schupp et al . . J. Bacterid. 177 
3673-3679 [1995] and Pospiech et al . . Microbiology !41, 1793- 
1803 [1995]). We have constructed genomic libraries of the 
epothilone producer Sorangium cellulosum So ce90. Gene probes 

based on PKS and PS genes were uspri t-« Ho^i=^» 

3 were usea co isolate recombinant cos- 

mids, which were then sequenced and characterized. Several 
unique pathways containing PKS, PS, or a combination of both 
types of genes were identified, demonstrating that this organ- 
ism is potentially a rich source of novel bioactive compounds. 

A subject of the present invention is therefore to provide 
DNA sequences according to claim 1 the expression products of 
which perform or are involved in the en 2 ymatic biosynthesis, 
mutasynthesis or partial synthesis of polyketide or hetero- 
polyketide compounds. The DNA sequences may be inserted into 
well known and optimized expression vectors by commmon tech- 
niques of molecular biology, thus allowing transformation, se- 
lection and cloning of cells, which cells are then capable of 
synthezising polyketide or heteropolyketide compounds by fer- 
mentation. Using an overproducing clone allows the desired 
polyketide or heteropolyketide compounds be easily produced and 
recovered in high amounts. Further, knowledge of the localiza- 
tion of regulatory DNA segments and individual structural genes 
allows "site-directed mutagenesis- using common techniques for 
genetic engineering, and thus construction of optimized enzymes 
("protein engineering") for fermentative synthesis of polyketi- 
de or heteropolyketide compounds. 



WO 00/22139 



4 



PCT7US99/23535 



The invention thus further relates to a recombinant ex- 
pression vector according to claim 16, cells transformed there- 
with according to claim 17 and to a process for enzymatic bio- 
synthesis, mutasynthesis or partial synthesis of polyketide or 
heteropolyketide compounds according to claim 23. 

Preferred and/or advantageous embodiments of the present 
invention are subject-matter of the subclaims. 

In brief, the invention consists of (1) cloned Sorangiim 
cellulose polyketide synthase (PKS) and/or peptide synthetase 
(PS) biosynthetic cluster DNA and (2) the nucleotide sequence 
and predicted protein coding sequences of the cloned DNA. The 
invention can be used for, but not limited to, (a) increasing 
yields of PKS product in Sorangiurn cellulosum (e.g., by ampli- 
fication or genetic modification of the epothilone gene cluster 
or its component parts}, (b) increasing yields of polyketide 
and/or peptide synthetase product in a heterologous system by 
transfer of the corresponding gene cluster or its component 
parts, which may be followed by amplification or genetic modi- 
fication of the PKS and/or PS gene cluster or its component 
parts, (c> modification of the polyketide and/or peptide syn- 
thetase product chemical structure in either Sorangiurn cellulo- 
sun, or a heterologous host (e.g., by genetic modification of 
the corresponding gene cluster or its component parts) and (d) 
for the detection of genes and gene products involved in making 
polyketides or related molecules in other organisms (e.g., by 
hybridization or complementation assays) . DNA sequence and 
analysis is presented for the following cosmids and plasmids: 
A2 cosmid as defined in claim 6 

the pEPOcose region (overlapping of pEPOcose and pEPOcos?) 
as defined in claim 7 
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pEPOcos8 cosmid as defined in claim 10 

A5 cosmid as defined in claim 12 

Sau4 (10 kb plasmid) as defined in claim 14 

The invention is now described in more detail by examples and 
for illustration only. The examples are not to be construed as 
any limitation of the scope. 

Figure 1 is a restriction map of one of the DNA sequences of 
the present invention (cosmid A2 insert) indicating also the 
localization of regulatory DNA segments and the individual 
structural genes {"open reading frames" or ORFs) 1 to 16. 

Figure 2 shows the open reading frames found on pEPOcos6 region 

DNA sequence data from A2 cosmid are as defined in claim 6. 

Table 1 correlates ORFs 1 to 16 found on A2 cosmid with the re- 
spective biological function (Regulators, Enzymes) . 
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gene /function 


1 

position 




ID 


ORF 1 


regulatory element 


1666 - 


1 




ORF 2 


regulatory element 


1605 - 


3338 






ORF 3 


acyl-t-RNA synthetase 


6100 - 


3398 


* 


20 


ORF 4 


monooxygenase 


7110 - 


6374 






ORF 5 


amino transferase 


9590 - 


8433 




25 


ORF 6 


L-dopa decarboxylase 


11393 - 


9855 






ORF 7 


cxidoreductase 


13656 - 


12712 






ORF 8 


polyketice synthase 


15374 - 


18984 




30 


ORF 9 


polypeptide synthetase 


20003 - 


27889 






ORF 10 


peptidase j 


28251 - 


29402 




35 


ORF 11 


regulatory element 


31720 - 


30401 






ORF 12 


sigma factor 


31982 - 


32932 






ORF 13 


regulatory element 


33128 - 


33613 




40 


ORF 14 


regulatory element 


33661 - 


34007 






ORF 15 


transcription regulator 


35611 - 


35255 




45 


ORF 16 


signal transduction 


37856 - 


35730 
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A. Construction of a Sorangium celluloaum cosmid library 



5 1. Isolation of genomic DNA from 3. cellul 



os urn So ce90 



a. Sorangium ceilulosun. So ce90 was spread onto solid CA-2 
agar and incubated at 30=C for 5-7 days. CA-2 agar is prepared 
by autoclaving is g Bacto-agar (Difco Laboratories, Detroit, 
MI) in 800 ml dH 2 0 for 20 min at 121°C and cooling to 50-55°C in 
a water bath. The following filter-sterilized solutions are 
added to the agar: 20% (w/v) glucose, 50 ml; Solution A (7.5% 
[w/v] KN0 3 ,7.5% K 2 HPO«), 10 ml; So l ution B {1-5% [w/v] 
MgS0 4 '7H 2 0), 10 ml; Solution C (0.2% [w/v] CaCl 2 2H 2 0, 0 . 15% [w/v] 
Feci,), 10 ml; 1 M HC1, 1 ml; autoclaved 4-day old Sorangium 
cellulosum broth, 100 ml. A sample of cells was removed from 
the plates with a sterile loop and inoculated into 50 ml of 
G51t medium in a 250 ml Erlenmeyer flask. G51t consists of 0.5% 
starch (Cerestar), 0.2% tryptone, 0.1% yeast extract, 0.05% 
Cad;, 0.05% MgSO 4 -7H,0, 1.2% 4- (2 -hydroxyethyl ) -l-piperazine- 
ethanesulfonic acid (HEPSS) , 0.2% glucose, pH 7.6. The flasks 
were shaken at 3 0°C. 160 rpm until a dense orange bacterial 
growth was obtained (ca. 5-7 d.). The cells were pelleted by 
centrifugation at 6,000 x g and used immediately or stored fro- 
25 zen at -20°C. 

The protocol used for isolating chromosomal DNA from bac- 
teria using hexadecyltrirriethylammmonium bromide (CTAB) has been 
described previously (Ausubel et al . , Current Protocols in Mo- 
lecular Biology, John Wiley and Sons, New York, 1990) . The pre- 
cipitated DNA was recovered with a bent Pasteur pipette, washed 
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with 70% and 95% ethanol, air-dried, and resuspended in 0 . 5 ml 
TE buffer (0.01 M Tris-HCl, 0.001 M ethylenediaminotetraacetic 
acid [EDTA] , pH 8.0) . 

b. Alternatively, genomic DNA was isolated from S. celluloaum 
cells cultured as described in section A.l using the Midi 
Qiagen Blood & Cell Culture DNA purification Kit (Qiagen, 
Hilden. Germany) following the Qiagen Genomic DNA Handbook pro- 
tocol for bacterial DNA isolation (1997, Qiagen, Hilden, Ger- 
many, p. 29 ff.). m order to obtain high molecular weight 
chromosomal DNA the precipitated DNA was recovered with a bent 
pasteur pipette as described in section A.l. 



2 . Isolation of plasmid DNA 

a. P FD666: pFD666 is a Afunctional E. coli- Strep Comyces cosmid 
cloning vector (see Denis and Brzezinski, Gene 111, 115-118 

[1992]). To maintain stability of large inserts, it is present 
in low-medium copy number when replicated in E. coll. For this 
reason, isolation of sufficient pure DNA to carry out cloning 
experiments was difficult using commercial kits with standard 
protocols. A modified procedure was therefore used to obtain 
PFD666 DNA. A 10 ml culture of DH10B (pFDS66) was grown for 16- 
20 hr at 37°C in LB (1% tryptone, 0.5% yeast extract, 0.5% 
NaCi, pH 7.0) medium containing 50 M g/ml kanamycin sulfate. 
Fifcy ml 0 f LB + kanamycin was inoculated to a starting OD 600 of 
ca. 0.25 and shaken at 300 rpm, 37°C, until the 6d 60 o reached 
ca. 0.6. Five hundred tr.l of LB + kanamycin medium in a 2 1 
flask was inoculated with 25 ml of this culture and incubated 
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under Che same conditions for 2.5 hr. Chloramphenicol ( 2.5 ml 
of a 34 mg/ml solution in 100% EtOH) was added and the incuba- 
tion continued for an additional 16-20 hr. (The previous seeps 
were performed according to Maniatis et al . Molecular Cloning: 
A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY, 1989.) Cells were pelleted for 10 min, 16,000 x g . 
They were resuspended in 9 ml of 50 mM glucose/25 mM Tris-HCl 
(pH 8.0) /10 mM EDTA and transferred to a 50 ml disposable cen- 
trifuge tube. One ml of a freshly-prepared 10 mg/ml lysozyme 
solution in 10 mM Tris-HCl, pH 8 . 0 was added and the cell sus- 
pension incubated in a 37°c water bath for 10 min. Twenty ml of 
a freshly-prepared 0.2 NaOH/1% sodium dodecyl sulfate (SDS) so- 
lution was added and the tube inverted gently 5-7 times to mix 
the contents. After 5 min at room temperature, 15 ml of 5 M po- 
tassium actate (pH 4.8) was added and the tube inverted sharply 
3-4 times. The tube was centrifuged at 6,000 x g for 10 min at 
4°C and the supernatant poured though 2 layers of sterile 
cheese cloth into a fresh 50 ml disposable tube. Isopropanol to 
a final concentration of 0.6% was added and the contents of the 
tube mixed several times. The precipitated nucleic acid was 
centrifuged at 6,000 x g for 10 min at 4°C. The pellet was 
washed with 70% EtOH and any excess EtOH was aspirated from the 
pellet, which was allowed to air dry for 5 min. It was resus- 
pended in 5 ml of 50 mM 3- (N-Morpholino) propanesulf onic acid 
(MOPS)/750 mM NaCl, P H 7.0 and added to an equilibrated to 
QlAf liter Midi column (Qiagen, Chatsworth, CA) . The manufac- 
turer's protocol for washing and eluting the plasmid DNA was 
followed. 
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b. SuperCos: SuperCos plasmid DNA was purchased from Strata 
gene (La Jolla, CA) . 

3. Preparation of ca . 3 8-47 kb Sau3Al fragments of S. cellulo- 
5 sum chromosomal DNA 



a. S. cellulosum chromosomal DNA prepared as described in sec- 
tion A.l.a was partially cleaved with restriction endonuclease 
Sau3Al in a 1000 pi reaction volume consisting of 50 /xg chromo- 
somal DNA, 5 units enzyme (Promega, Madison , wi), 0.006 M 
Tris-HCl, 0.006 M MgCl 2 , 0.10 M NaCl , and 0.001 M dithiothrei- 
tol (pH 7.5) for 5 min at 37°C. The reaction mixture was ex- 
tracted once with an equal volume of 1:1 phenol : chloroform. 
After centrifugation, the upper aqueous phase was saved, to 
which 0.1 vol. of 3 M sodium acetate and 0.6 vol. isopropanol 
was added. DNA was pelleted by centrifugation for 5 min at 
16,000 x g in a microfuge and washed once with 0 . 5 ml 70% EtOH. 
After drying in a SpeedVac (Savant Instruments, Farmingdale, 
NY) for 5 min, the pellet was resuspended in 0.1 ml TE buffer. 
The DNA was layered ont:cp of a 12 ml 10-40% sucrose gradient 
prepared in TE buffer and centrifuged at 113,600 x g for 16 hr, 
10°C using a Beckman SW40Ti rotor (Beckman Instruments, Palo 
Alto, CA) . Five hundred ^1 aliquots of the gradient were re- 
moved using a pipetor beginning at the top of the tube. Samples 
(5 M l) of the fractions were analyzed by electrophoresis 
through a 0.5% agarose gel in TAE buffer (0.04 M Trizma base, 
0.02 M acetic acid, and 0.001 M ED'TA, pH 8.3) containing 0.5 
fig /ml ethidium bromide for 6 hr at 100 V. Fractions containing 
DNA fragments of ca. 40-45 kb were identified by comparison to 
a high molecular weight DNA standard (Life Technologies, 
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Gaithersburg, MDJ . Sucrose was diluted from the corresponding 
0.5 ml fraction by addition of 0.5 vol. TE. Subsequently, DNA 
was precipitated by addition of 0.1 vol. 3 M sodium acetate and 
0.6 vol. isopropanol . DNA was pelleted by centrifugation at 
16,000 x g for 10 min in a microf uge . DNA was washed with 0.5 
ml 70% EtOH and dried in a SpeedVac with moderate heat for 10 
min. Finally, the DNA was resuspended in distilled H 2 0 at a 
concentration of 0.5 mg/ml . 

b. Alternatively, 10 pig of S. cellulosum chromosomal DNA pre- 
pared as described in A.l.b was treated with 0.3 U Sau3Al (New 
England Biolabs, Beverly, MA) for 1 h at 37°C in 400 pi of the 
supplier's recommended reaction buffer. Formation of DNA frag- 
ments of about 4 0 kb in size was checked by comparison of the 
motility behavior with high molecular weight DNA standards af- 
ter a 0.3% agarose gel electrophoresis. An equal volume of phe- 
nol : chloroform (1:1) was added, mixed and centrifuged. The up- 
per aqueous phase was recovered and 0.1 vol. of 3 M sodium ace- 
tate and 0.6 vol. of isopropanol were added. After centrifuga- 
tion, the precipitated DNA was washed twice with 0 . 5 ml 70% ice 
cold ethanol and finally air-dried. The DNA fragments were re- 
suspended in 100 pjl shrimp alkaline phosphatase reaction buffer 
and dephosphorylated for 150 min. at 37°C using 2 U shrimp al- 
kaline phosphatase (Amersham Life Science, Cleveland, OH) . A 
phenol : chloroform extraction followed as described above. Fi- 
nally, the DNA was precipitated by addition of 0.1 vol. 3 M so- 
dium acetate and 0.6 vol. isopropanol, dried, and dissolved in 
TE buffer. 
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4. Preparation of cosmid libraries 

a. Using P PD666: Vector pFD666 was cleaved with restriction 
endonuclease BamHI in a 0.02 ml reaction volume consisting of 2 
M9 Plasmid DMA, 10 units of BamHI (Promega), 0.006 M Tris-HCl, 
0.006 M MgCl 2 , 0.05 M NaCl , and 0.001 M dithiothreitol ( P H 7.5)' 
for 90 min at 37°C. Five „l of lOx alkaline phosphatase buffer 
(0.5 M Tris-HCl [pH 9.3], 0.01 M MgCl,, 0.001 M ZnCl 2 . 0.01 M 
spermidine) was added to the reaction followed by alkaline 
phosphatase (0.01 units/pmol ends; Promega) and distilled H 2 Q 
to a final volume of 0.05 ml. The sample was incubated for 30 
min at 37°C and a second aliquot of phosphatase was added. Af- 
ter a further 30 min at 37°C. .0.3 ml of stop buffer (0.01 M 
Tris-HCl [ P H 7.5], 0.001 M EDTA, 0.2 M NaCl, 0.5% SDS) and 0.35 
ml of 1 = 1 phenol; CHC1 3 was added to the reaction. The sample 
was mixed gently several times by inversion and centrifuged at 
16,000 x g for 3 min to separate the phases. The aqueous layer 
was removed to a new microfuge tube. 0.1 vol. 3 M sodium ace- 
tate and 2 vol. 100% EtOH were added and the precipitated DNA 
pelleted by centrifugation at 16,000 x g for 10 min. Liquid was 
removed by aspiration and the pellet washed once with 0.5 ml 
70% EtOH. The DNA was dried in a SpeedVac and resuspended in TE 
buffer to 0.5 mg/ml . 

Digested, phosphatase- treated pFD666 was ligated to the 
partially-cleaved chromosomal DNA (see sections A. 3. a and 
B.l.a) in a 0.005 ml reaction consisting of 1 /ig pF D666, 1 M g 
S. cellalosum DNA, 0.03 M Tris-HCl ( P H 7.8), 0.01 M MgCl,, 0.01 
M dithiothreitol, and 0.0005 M adenosine- 5' -triphosphate and 
1.5 Weiss units of T4 DNA ligase (Promega). The reaction was 
carried out at room temperature for 2 hr. The entire reaction 
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mix was packaged into bacteriophage X in vitro using Packagene 
extracts (Promega) according to the manufacturer's directions. 
The entire packaging reaction (0.5 ml) was diluted with 4 . 5 ml 
SM buffer (per liter: 5.8 g NaCl, 2 g MgSO«.7H 2 0, 1 M Tris- 
HCl[pH 7.5], 5 ml 2% gelatin solution). Transfection was per- 
formed by adding 10 ml cf an overnight culture of E. coli DH5a 
that had been grown in LB medium with 0.01 M MgS0 4 and 0.2% 
maltose to the diluted phage and incubating at 37°c for 20 min. 
0.8 ml of LB was added and the cells shaken at 22 5 rpm for 1 hr 
at 37°C. Cells were pelleted, resuspended in LB, and spread 
onto a 150 mm LB + kanamycin agar plate. After 3 d. at 30°C, 
the colonies were harvested by picking ca. 800 colonies into 
2.0 ml LB + kanamycin medium containing 20% glycerol, freezing 
on dry ice, and storing at -70°C. In addition, six kanamycin- 
resistant colonies were inoculated into 2 ml LB + kanamycin 
liquid medium and incubated at 37°C, 250 rpm, for 18-24 hr. 
Cosmid DNA was prepared using a standard alkaline lysis proce- 
dure starting with 1.5 ml of the culture. DNA was digested with 
restriction endonuclease ?stl and samples electrophoresed on a 
0.8% TAE agarose gel for 1.5 hr at 100 V. A unique restriction 
pattern was noted in each sample and the total size of the in- 
sert was calculated to be between 40 and 45 kilobases. 

b. using SuperCos: 3 0 of vector SuperCcs was digested with 
Xbal (New England Biolabs, Beverly, MA) for 210 min at 37°C in 
100 pi of the recommenced reaction buffer. Ten nl sodium ace- 
tate and 60 Ml isopropar.ol was added before the solution was 
centrifuged for 30 min a. 16,000 x g. The precipitated DNA was 
washed twice with 500 ^- ice cold 70% ethanol . The vector DNA 
was precipitated and air-dried, dissolved in 135 jul shrimp al- 
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kaline phosphatase reaction buffer and treated with 2.5 U 
shrimp alkaline phosphatase for 150 min. After heat inactiva- 
tion of the enzyme at 75°C for 20 min, a phenol : chloroform ex- 
traction was performed as described in section l. c. The DNA, 
resuspended in 100 M l BamHI restriction buffer was hydrolyzed 
with 15 U BamHI (New England Biolabs, Beverly, MA) for 180 min. 
A phenol : chloroform extraction followed (see section A. 3). The 
SuperCos DNA was precipitated by additon of 0.1 vol 3 M sodium 
acetate and 0.6 vol isopropanol, centrifuged at 16,000 x q, and 

20 

10 resuspended in 50 ptl TE buffer. 

Four of digested vector DNA was ligated with 10 jxg par- 
tially hydrolyzed genomic DNA from S. cellulosum (as described 
in section A.3.b) in a final volume of 20 i±l using 2 U T4 DNA 
ligase and the appropriate reaction buffer (Gibco BRL, Eggen- 
15 stein, Germany) . The reaction was carried out at 16°C over- 
night. The reaction mixture was packaged into phage particles 
using the Gigapack III XL packaging extract kit (Stratagene) 
according to the manufacture's protocol. Treatment of packaging 
reaction mixture and transfection of £. coli SURE (Stratagene) 
was performed as described in 4. a. Transfected cells were con- 
centrated by centrifugation, resuspended in fresh LB medium and 
distributed on LB agar plates containing 50 /ig/ml^kanamycin. 
The plates were incubated overnight at 30°C. 1600 recombinant 
clones were transferred into 96 well microtiter plates filled 
25 with 80 m1 LB medium containing 50 MgMl'kanamycin per well and 
propagated overnight at 30°C. The following day the microtiter 
plates were used to inoculate a second set of microtiter plates 
in order to obtain a duplicate of the recombinant clones. Each 
well of the original sen of microtiter plates was supplemented 
with 80 pi 50 % glycerol and the entire plate stored at -70°C. 
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20 randomly chosen transf ormants were inoculated into 3 ml 
LB medium with 50 ^/ml' 1 kanamycin and incubated over night at 
37°C in order to isolate plasmid DNA using the Qiagen plasmid 
extraction kit (Qiagen, Hilden, Germany). Restriction fragment 
analysis of the recombinant cosmids using the restriction endo- 
nucleases PstI and Bglll indicated that the cosmids contained 
inserts of approximately 35 to 42 kb in size. 

B. Construction of a S. cellulosum plasmid library 

1. Preparation of 8-12 kb fragments of 5. cellulosum chromoso- 
mal DNA. 

5. cellulosum chromosomal DNA prepared as described in sec- 
tion A.l.a was partially cleaved with restriction endonuclease 
Sau3Al in a 100 uL reaction volume consisting of 5 ug chromoso- 
mal DNA, 5 units enzyme (Promega, Madison , WI), 0.006 M Tris- 
HC1, 0.006 M MgCl 2 , 0.10 M NaCl , and 0.001 M dithiothreitol (pH 
7.5) for 4 min at 3 7°C. The digested DNA was electrophoresed 
through a 11 x 14 cm 0.8% TAE-agarose gel for 18 hr at 17 V. 
Fragments of a -12 kb were cut from the gel and purified using 
the QIAquick Gel Extraction Kit using the manufacturer's proto- 
col (Qiagen) . 



25 2. Preparation of the plasmid library 

Plasmid P Zero2.1 (Invitrogen, Carlsbad, CA) was cleaved with 
restriction endonuclease BamHI in a 0.02 ml reaction volume 
consisting of 1 ug plasmid DNA, 10 units of BamHI C Promega ) , 
30 0.006 M Tris-HCl, 0.006 M MgCi 2 , 0.05 M NaCl , and 0.001 M di- 
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thiothreitol ( p h 7.5} for 20 min at 37°C. 0.08 ml of dH 2 0 and 
0.1 ml of i:i phenol :CHC13 was added. The sample was briefly 
vortexed and centrifuged at 16,000 x g for 2 min. The aqueous 
layer was removed to a new microfuge tube. 0.1 vol. 3 M sodium 
acetate and 2 vol. 100% EtOH were added and the precipitated 
DNA pelleted by centrif ligation at 16,000 x g for 10 min. Liquid 
was removed by aspiration and the pellet washed once with 0.5 
ml 70% EtOH. The DNA was dried in a SpeedVac and resuspended in 
TE buffer to 0.004 ng/ml . Digested P Zero2 . 1 was ligated to the 
partially-cleaved chromosomal DNA in a 0.01 ml reaction con- 
sisting of 0.004 ug P Zero2.1, 0.05 ra s. cellulosum DNA, 0.03 M' 
Tris-HCl <pH 7.8), 0.01 M MgCl 2 , 0.01 M dithiothrei tol , and 
0.0005 M adenosine-5' -triphosphate and 1.5 Weiss units of T4 
DNA ligase (Promega) . The reaction was carried out at room tern- 
15 perature for 2 hr. 0.015 ml dH 3 0 and 0.25 ml of' l-butanol were 
added, the sample vortexed briefly, and centrifuged at 16,000 x 
g for 10 min. Liquid was aspirated away from the pellet and the 
sample dried in a SpeedVac for 5 min. The ligated DNA was re- 
suspended in 0.005 ml dH 2 0 and mixed with 0.04 ml of electro- 
competent Escherichia coli DH10B cells {GIBCO/BRL, Gaithers- 
burg, MD) . The sample was placed into a pre-chilled 0.2 mm-gap 
electroporation cuvette and transformed into the bacteria by 
electroporation using a BioRad Gene Pulser II unit (BioRad, 
Hercules, CA) at 25 uF and 200 n. 0.96 ml SOC medium (0.5% 
yeast extract, 2% tryptone, 10 mM NaCl, 2 . 5 mM KC1 , 10 mM 
MgCl 2 , 20 mM MgS0 4 , 20 mM glucose) was mixed with the cells and 
transferred to a 1.5 ml microfuge tube. The sample was incu- 
bated at 37°C, 225 rpm, for 1 hr. Aliquots of the cells were 
spread onto an LB agar + kanamycin and incubated at 37°C for 20 
hr to estimate the number of transf ormants obtained. Six kana- 
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mycin resistant colonies were confirmed to contain an insert of 
the expected size as described in section A. 4. a. 

C. Identification of cosmids possessing polyketide synthase 
5 genes 

1. Colony blot hybridizations using cosmid library in pFD66€ : 

A 20 x 20 cm sheez of Duralon UV membrane (Stratagene) 
was placed on top of a 24 . 5 x 24 . 5 cm square bioassay dish con- 
taining 250 ml LB agar - kanamycin. An aliquot cf the frozen 
cosmid library in 1 ml IH medium was spread on the filter. The 
plate was incubated at 37^c for 24 hr. Colonies were replicated 
onto two fresh filters which were placed onto LB + kanamycin 
agar medium and incubated at 28°C for 18 hr. Lysis of cells and 
neutralization of released DNA was performed according to di- 
rections that were provided with the filters. The DNA was 
crosslinked to the filters using a UV Stratalinker 2400 unit 
(Stratagene) in the autc crosslink mode. Cell debris was re- 
moved by placing the fibers in a container with a solution of 
3 X SSC (20 X SSC contains, per liter, 173.5 g Nad, 88.2 g so- 
dium citrate, pH adjusted to 7.0 with 10 N NaOK) , 0.1% SDS and 
rubbing the lysed colonies with a Kimwipe. The filters were 
then incubated at least 3 hr with the same wash solution for at 
least 3 hr at 65°C. The plasmid library was treated similarly 
except cells were spread onto a 137 mm circular Duralon UV mem- 
brane placed on top of a 150 mm petri dish containing 80 ml LB 
agar + kanamycin. 

For hybridizations, a probe consisting of a 650-base pair 
(bp) polymerase chain (?CH) fragment representing a portion of 
a S. cellulosum polyketide synthase gene was used. The fragment 
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was amplified using primers to consensus regions of Type I 
(macrolide) polyketide synthase (PKS) genes (Swan et al . , Mol . 
Gen. Genetics 242, 358-362 [1994]). A series of sense and anti- 
sense oligonucleotides were prepared for PCR studies as indi- 
cated in the following table 2: 

Table 2 



Oligo- 
nucleotide 


I. DNA seauence (5'-> 3') 


Corr e sponding 
amino acid 
sequence 


120 

(sense) 


CGGT (C/G) AAGTC (C/G) AACATCGG 


KSNIGHT 


121 (anti- 
sense) 


GC ( A/G) ATCTC (A/G) CCCTGCGA ( A/G) TG 


HSQGEIA 


122 

(sense) 


GT(C/G)GACAC£C/G)GC(C/G)TGCTC(C/G) ! VDTACSS 

i 
i 

i 


123 

(sense) 


GG (C/G) AC (C/G) AACGC (C/G) CACGT (C/G) A ? 
T 


GTNAHVI 


124 (anti- 
sense) 


CCCTG (C/G) CC [C/G) GGGAA (C/G) ACGAA j 


FVFPGQG 



The selection of C or G where necessary in the third position 
of a codon reflects the very high overall G + c content of S. 
cellulosum (ca. 70%). Conditions for PCR were as follows: 0.01 
M Tris-HCl (pH 9.0), 0.CS M KC1, 0.003 M MgCl 2 , 0.1% Triton X- 
10C..200 of each primer, 2.5 U Taq DNA polymerase (Promega) , 
5.C* dimethyl sulfoxide {Sigma) , and 0.01 Mg of S. cellulosum 
chromosomal DNA in a 0.35 ml reaction volume. Reactions were 
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carried out in a Perkin-Elmer Model 480 Thermocycler (Perkin- 
Elmer Corporation, Foster City, CA) under the following condi- 
tions: 94°C, 1 min; 50°C, 1 min, 72°C, 1.5 min for a total of 30 
cycles. Each possible combination of sense and anti-sense prim- 
ers were tried. A 650-bp and 350-bp fragment was amplified us- 
ing cligos 120 + 124 and 123 +124, respectively. The sequence 
of the fragments were determined using the ALFexpress AutoRead 
kit to fluorescently label the DNA, which was analyzed on an 
ALFexpress sequencing apparatus (Pharmacia) . The data indicated 
both PCR fragments possessed significant homology to polyketide 
synthase genes of Type I antibiotics. The 650-bp fragment was 
chosen for hybridization experiments. 

The fragment was labeled with 32 P-dCTP using the NEBlot kit 
(New England Biolabs, Beverly, MA) and purified on a Bio-Spin 6 
column (BioRad, Hercules, CA. ) . Duplicate blots were pre- 
hybridized in 3 X SSC (1 X SSC contains 0.15 M sodium chloride 
and 0.015 M sodium citrate, pH 7.0), 4 X Denhardt ' s solution 
(100 X is 2% Ficcll [Type 400], 2% polyvinylpyrrolidone, and 2% 
bovine serum albumin [Fraction V]), and 100 /xg/ml sheared, de- 
natured salmon sperm DNA; all reagents purchased from Sigma 
Chemicals, St. Louis. The labeled DNA was heated in a boiling 
water bath for 5 min to denature the strands, cooled on ice, 
and added to the pre-hybridization solution. The filters were 
incubated for at least 18 hr in a roller bottle hybridization 
oven. They were transferred to new bottle, then washed two 
times in 2 X SSC, 0.1% SDS at 70°C for 30 min (moderate strin- 
gency) . The membranes were placed on Whatman 3 MM paper to re- 
move excess liquid, covered with Saran Wrap, and exposed to 
autoradiography film (Kodak X-OMAT hS) with two intensifying 
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screens. The cassette was placed at -70°C and developed at ap- 
propriate intervals. 

Approximately 100 colonies were seen to have hybridized on 
the duplicate filters. Fourteen of these were isolated from the 
master plate and grown in 4 ml LB + kanamycin medium for 20-24 
hr, 37°C, 250 rpm. Plasmid DNA was prepared using the standard 
alkaline lygis method and digested with restriction endonucle- 
ase Pstl. The digested DNA was electrophoresed on a 0.8% aga- 
rose gel in TAB for 3 hr at 100 V. Fragments were transferred 
to Duralon UV using the VacuGene XL vacuum blotting unit (Phar- 
macia) and the recommenced alkaline denaturarion protocol. Hy- 
bridization with radioactively- labeled PCR fragment and washing 
were carried out as described above. Two prominent types of 
cosmids were observed; one contained Pstl fragments of ca. 7.0, 
5.0, and 1.1 kb <pEP0cos5 and P EP0cos7) that hybridized to the 
probe; the other type had fragments of ca. S.O and 3.6 kb 
(pEPOcosS and P EPOcosl3; which were homologous to the probe. 
Restriction analysis confirmed that cosmids showing identical 
hybridization patterns had identical or overlapping inserts. 
PCR reactions using primers representing consensus sequences of 
Type I PKS genes were performed using the isolated cosmid DMA 
as template under conditions described above, except ca. 0.01 
Hg of cosmid DNA was included as template. Cosmids pEPOcosS and 
pEPOcosS amplified the S50-bp fragment seen when oligonucleo- 
tides 120 + 124 were used, while pEPOcosS and pEP0cosl3 sup- 
ported amplification of an 1100-bp PCR fragment with oiigos 122 
and 124. The latter fragment was sequenced and confirmed to 
possess strong similarity to Type I PKS genes. These data con- 
firm that the recombinant cosmids are related to each other and 
that all contain PKS-like genes. 



WO 00/22139 

PCT/US99/23535 

21 



10 



15 



20 

10 



25 



30 



20 

35 



40 

25 



45 



30 
50 



2, Colony blot hybridizations of plasmid library in P Zero2.1: 

A 137-mm circle of Duralon UV membrane was placed on top 
of a 150-mm containing 75 ml LB agar + kanamycin. An aliquot of 
the plasmid library (representing ca. 2/ 000 recombinant colo- 
nies) in 0.5 ml LB medium was spread on the filter. The plate 
was incubated at 37°c for 20 hr. Colonies were replicated onto 
two fresh filters which were placed onto LB + kanamycin agar 
medium and incubated at 37°C for 6 hr. The filters were proc- 
essed for hybridization as described in Section C.l. Out of 8 
positive colonies detected, one contained a plasmid with a DNA 
region not encoded by ei'her pEP0cos6 or pEPOcosS . This plas- 
mid, called Sau4, was characterized in more detail. 

15 3. Colony blot hybridizations of cosmid library 'in SuperCos: 

The recombinant E. coli clones from the microtiter plates 
(see section 4. b) were used to produce two identical sets of 
hybridization filters in order to identify cbsmids carrying PKS 
and PS genes. The recombinant clones were spotted onto 2 sets 
of 22 x 22 cm LB agar plates containing 50 M g/ml kanamycin. 
Each plate contained 384 clones therefore representing 4 micro- 
titer plates. The clones were incubated at 30°C overnight. Af- 
ter pre-cooling for approximately 3 h at 4°C, 20 x 20 cm Hybond 
IT Nylon membranes (Arr.ersham, Braunschweig, Germany) were 
placed onto the agar surfaces. After 2 min. the membranes were 
removed and placed for 15 min. on Whatman 3 MM paper (Whatman 
paper Ltd., Maidstone, England) soaked with denaturation solu- 
tion (0.5 N NaOH, i , 5 M NaCl) before they were transfered onto 
Whacman 3 MM paper saturated with neutralization solution (1 M 
Tris-HCl, pH 7.5, 1.5 K NaCl) . Subsequently the membranes were 
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placed onto Whatman 3 MM paper soaked with 2 X SSC (0.3 M NaCl, 
0.03 M sodium citrate, pH 7.2) for 10 min. The membranes were 
baked for 40 min at 8f5°C. Then, each membrane was overlayed 
with 5 ml Proteinase K solution {2 mg/ml Proteinase K in 2 x 
SSC) and incubated at 37 °C for 90 min. Finally, cell debris 
was removed by wiping the membranes with a Kimwipe pre-wetted 
with 2 X SSC. 

As we were seeking in particular to identify biosynthetic 
pathways containing both PKS and PS genes, the following hy- 
bridization strategy was taken: The screening was initially fo- 
cused on ketosynthase domains from type I PKSs and on the ade- 
nylation domain from PSs. Target-specific primers were used to 
amplify DNA fragments of the corresponding genes from chromoso- 
mal DNA of s. cellulosum by pcr. The fragments obtained were 
then cloned, sequenced and the deduced amino acid sequence com- 
pared to known ketosynthase and adenylation domains of PKS and 
PS, respectively. In a second step these PCR fragments were 
used as gene probes to detect recombinant cosmids of the S. 
cellulosum cosmid library. 

Oligonucleotides based on conserved amino acid sequences 
of ketosynthase domains from various type I PKS were optimized 
for myxobacterial DNA by comparison to a known myxobacterial 
biosynthetic gene cluster (Schupp et al . , J. Bacterid. 177, 
3673-3679 [1995]) resulting in primer 
KSIUp (5'- 

C/AIGIGA(A/G)GCI (A/C/T) (A/T) I (C/G) (C/A) IATGGA (C/T) CCICA (A/G) CAI 
(A/OG-3') and 

KSC1 (5'-GG(A/G)TCICClA(A/G)I{G/C) (T/A) IGTICCIGTICC (A/G) TG- 3 ' ) . 
PCR-primers TGD (5'- 

T(A/T) (C/T)CGIACIC-GIGA(C/T) (C/T) (G/T) IG (G/T) ICG- 3 " ) and 
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LGG (5*- 

A{A/T)IGA(A/G) (G/T) (Q/C) ICCICCI (A/G) (A/G) (G/C) I (A/C) (A/G) AA (A/G 
)AA-3') 

directed to genes encoding adenylation modules have been de- 
scribed by Turgay et al . (Pept. Res. 7, 238-241 [1994]). ?CR 
reaction mixtures with a final volume of 25 pi contained 0 . 1 /xg 
template DNA, 0.2 U Taq DNA- polymerase (Gibco BRL, Eggenstein, 
Germany), 5 Mmol dNTP, 5% dimethyl sulfoxide (Sigma), 1.5 mM 
MgCl 2 , 25 pmol of each primer and the appropriate reaction 
buffer supplied by Gibco BRL . Chromosomal DNA of S. cellulosum 
was used as template. Additionally, chromosomal DNA of Myxococ- 
cus fulvus was used with PS primers. Reactions were carried out 
in an Eppendorf Mastercyler Gradient (Eppendorf, Germany) using 
the following conditions: denaturation 30 s at 97°C, annealing 
30 s at 55°C, extension 60 s at 72°C for a total of 30 cycles. 
The formation of ca. 700 bp fragments using the KS primers and 
of ca. 350 bp fragments with the PS primers were confirmed by 
0.8% agarose gel electrophoresis. Fragments of independent PCR 
reactions were ligated into vector pCR2.1TOPO using the TOPO TA 
Cloning kit (Invitrogen, Leek, The Netherlands) according to 
the manufacturer's protocol and transformed into £. coli XL1- 
Blue. Sequencing of the resulting plasmids and analysis of the 
deduced amino acid sequence revealed three different KS frag- 
ments, designated pM00 8.4, pM008.6, pM008.7, one PS fragment 
(pAPsl) corresponding to S. cellulosum and one PS fragment 
(pDPsl) obtained with chromosomal DNA of M. fulvus. The PCR 
fragments were re-isolated by digestion wich ScoRI from the 
plasmids pM008.4, pM008.6, and pM008.7, labeled, pooled and 
used as gene probes in hybridization experiments as described 
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below. The same procedure was performed with the PS fragments 
of pAPsl and pDFsl. 

Hybridization with PKS and PS specific DMA probes (see 
above) was carried out using the DIG nonradioactive labeling 
and detection kit (Boehringer Mannheim, Germany) and performed 
according to the supplier's manual using buffer containing 50% 
formamide. The membranes were hybridized in plastic bags con- 
taining approx. 10 ml of hybridization solution at 3 9°C over- 
night. Unspecific binding of probes was removed by 2 wash steps 
with 2 x SSC, 0.1% SDS at room temperature for 20 min. and one 
stringent wash step with 0.5 x SSC, 0.1% SDS at €0°C for 20 
min. Detection of hybridizing DNA fragments was performed with 
the above mentioned system according to the manufacturer's pro- 
tocol using CSPD as chemi luminescent substrate. The signals 
were recorded by exposure of the treated membrane to Hyperfilm 
ECL (Amersham Life Science, Little Chalfont, England) which was 
developed in appropriate time intervals. 

71 signals were detected with the PKS specific gene probe. 
On the duplicate filters 3 5 signals were obtained with the PS 
specific gene probe of which 7 were already known from the PKS 
hybridization experiment. These recombinant cosmids harbored 
PKS- and PS-encoding genes. In order to corroborate these re- 
sults PCR experiments were performed with DNA of the 7 recombi- 
nant cosmids as template and PKS (KSlUp, KSD1) and PS specific 
primers (TGD, LGG) generating fragments of the expected size of 
approx. 700 bp and 350 bp, respectively (primers and reaction 
conditions. see above) . 

A comparison of the restriction fragment patterns of the 
DNA from the 7 recombinant cosmids carrying PKS and PS genes 
digested by BamHI facilitated an arrangement of the cosmids in 
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3 groups. They were represented by cosmids designated A2 and 
AS. The remaining croup was represented by pEPOcosS. Therefore, 
A2 and A5 represented gccd candidates for further DNA sequence 
analysis because they carry both PKS and PS genes. 

D. Random "shotgun" sequencing of recombinant cosmids and plaa- 
mids 

1. Library construction 

a. pEPOcosS, pEPOcosS, A5, and Sau4 : pEPOcosS and pEPOcos? 
were sequenced to completion, and contiguous sequence data and 
analysis for these overlapping cosmids is presented below for 
the »cos6 region" < C f. claims 7 and 9}. Sequencing of cosmid 
A5, pEPOcosS and plasmid Sau4 was taken to the point of large 
contiguous sequences (ccntigs) representing the S. celiulosuzn 
insert; sequence and analysis presented below (cf . claims 10 to 
15) . 

Randomly sheared libraries were constructed for cosmids 
and plasmids of interest using a protocol similar to that of of 
Fleischmann et al . , 1995 (Science 269, 496) and modified in 
Fraser et al . , 1995 (Science 170, 397). Briefly, Qiagen-column 
purified cosmid DNA (-10 fig) was sheared to a size of approxi- 
mately 2 kb and the DNA end-repaired using BAL31 nuclease. The 
DNA was gel-purified after electrophoresis through a 0.75% low- 
melting temperature agarose gel containing 0.5 M g/ml ethidium 
bromide in IX TAE buffer run at 80 V for 2 hours. The volume of 
the low-melt agarose gel slice was estimated by adding the gel 
slice to a microfuge tube and weighing, then 0.1 vol. of 3 M 
sodium acetate (pH 7) was added and the agarose incubated at 
60°C. The temperature was equilibrated to 37°C, and DNA ex- 
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tracted twice using an equal volume of buffered phenol (Life 
Technologies) . The aqueous phase was transferred and extracted 
once with an equal volume of chloroform, then ethanol preci- 
pated by the addition of 2 vol. cold 100% ethanol. DNA was con- 
centrated by spinning at 16,000 x g in a microcentrifuge. The 
DNA pellet was washed wich 1 ml 70% ethanol and resuspended in 
100 M l of o.ix TE. The DNA was ligated to Smal-digested, phos- 
phatase- treated pUC18 vector (Pharmacia), and single insert re- 
combinants isolated by gel-purification of the band containing 
vector plus a single insert, followed by T4 polymerase polish- 
ing, and a final intramolecular ligation of the vector-plus- 
single- insert DNA. This final ligation represents a library of 
highly random ca. 2 kb fragments that was used for shotgun se- 
quencing of the ca. 4 0 kb cosmids or ca. 10 kb plasmids. 

b. Cosmid A2 : Cosmid DNA with inserts of S. cellulosum was 
isolated by an alkaline lysis procedure and purified with Ma- 
cherey Nagel columns (Machery und Nagel GmbH und CoKG, Duren, 
Germany) using manufacturer's recommendation. Purified Cosmid 
DNA was sonicated, end- repaired using T4 DNA Polymerase (Boe- 
hnnger Mannheim, Germany) . After gel-purification fragments of 
a size of approximately 2 kb were ligated into Smal-digested, 
phosphatase -treated pTZ18R vector (Pharmacia) . The ligation 
represents a library of highly random ca. 2 kb fragments that 
was used for shotgun sequencing of the ca . 40 kb cosmid. 

2. Sequencing and assembly 

a. pEPOcos6, pEPOcos8, Sau4, and A5 : DNA (1 /U of 100 ptl 

total in the library) was transformed into E. coli by electro- 
poration (20 /il of Electromax DH10B cells from Life Technolo- 
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gies) and cells spread onto LB plates containing 50 M g/ml ampi- 
cillin. After growth overnight at 37 °C. transf ormants (ca 300- 
3000 CPU total, were tranfered to 96-well growth blocks and 
shaken overnight at 37°C in 1.3 ml LB medium with 50 «g /nl am . 
Picillin. Templates were prepared from these cells by an alka- 
line lysis procedure (Qiagen QiaQuick Turbo Prep) to yield pu- 
rified, double-stranded plasmid DMA. Cycle-sequencing of the 
Plasmid templates was performed using universal forward and re- 
verse primers and BigDye Terminator sequencing kits (Applied 
Biosystems). using the manufacturer's recommendations, then re- 
solved using an ABI377 automated sequencer. Sequences were ed- 
ited using Phred, then assembled into larger contiguous se- 
quences using Phrap (Phil Green, University of Washington, St. 
Louis, MO) . 

b. Cosmid A2: DNA (1 ul of 20 ul total in the ligation) 
was transformed into E. coli DH10B by electroporation and cells 
were spread onto LB agar medium containing 50 mg/ml ampicillin. 
After growth for 18 hr at 37<>c. transformants were transferred 
to 96-well growth blocks and shaken overnight at 37°c in 1.3 ml 
2x YT medium with 50 mg/ml ampicillin. Templates were prepared 
from these cells by an alkaline lysis procedure (Qiagen Qia- 
quick Turbo Prep) to yield purified, double -stranded plasmid 
DNA. Cycle-sequencing of the plasmid templates was performed 
using universal forward and reverse primers and Big Dye Termi- 
nator sequencing kits (PEBiosystems) or Thermo Sequenase fluo- 
rescent labelled primer cycle sequencing kit (Amersham Pharma- 
cia Biotech) using the manufacturer's protocols. In .the shotgun 
phase of a cosmid, identical amounts of samples were sequenced 
either by dye-primer or dye- terminator chemistries (Pharmacia, 
PE Biosystems). Data were collected using Licor and ABI 377 
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automated sequencers and assembled with the GAP4 program (Bon- 
field, Smith, Staden, Nucl . Acids Res. 23, 4992-4999 [1995]). 
Gaps were closed using custom made primers (MWG- Biotech) on 
plasmid templates or PCR products in combination with dye- 
terminators . 



E. Bioinf ormatic Methods 



1. Open reading frame (ORF) identification 

ORFs were identified m the P EPOcos6 region using the OMIGA 
1.1.2 (GCG 0.4D) program from Oxford Molecular Limited. Default 
values were used (Standard genetic code, all ORFs over 50 
bases) to generate ORFs; analysis of these results lead to the 
list of 14 highest qualicy 0 RFs as defined in claim 9. Other 
ORFs, genes, or genetic elements may be found in the P EPOcos6 
insert that have not yet been annotated. In addition to hand- 
editing of the OMlGA-ger.erated data, the MAGPIE automated ge- 
nome analysis tool : 

( frt tp ; / / genomes . rockef el 1 « r . edu/maanie/maarn a . ht- , m1 ^ 

was used to identify genes for all the sequenced cosmids and 
plasmids. ORFs identified in this manner are presented as both 
nucleotide and peptide files below. 

For cosmids A2 and AS, ORFs have been identified within 
the DNA sequences of A5 (contigs 10, 11, 12) and of A2 using 
the FramePlot analysis program from Ishikawa and Hotta (FEMS 
Microbiol. Lett., 174, 251-253 [1999] public available under 
[http://www.nih.go.jp/-jun/cgi-bin/frameplot.pl] which is based 
on positional base preference in codons typical for organisms 
having genomes with a high G + C content (Bibb et al . , Gene 30, . 
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157-166 [1984]). Default parameters using ATG and GTG as start 
codons were used. The deduced amino acid sequence of predicted 
ORFs were compared with protein databases (GenBank, CDS trans- 
lations, PDB, SwissProt, PIR, PRF) using BLASTP (Altschul et 
al., Nucleic Acids Res., 25, 3389-3402 [1997]). Additionally, 
high scoring amino acid sequences were analyzed using the Pfam 
program [http://www.sanger.ac.uk/Software/Pfam/], which identi- 
fied specific domain structures of the submitted proteins 
{Bateman et al . Nucleic Acids Res., 27, 260-262 [1999]). 

2 . BLAST searches 

3LASTP2 similarity searches were performed using the peptide 
files from the above ORF identification strategy as query se- 
quences. Searches were performed using the in-house Bioinfor- 
matics BLASTP2 (Version: 3LASTP 2 . 0al9MP-WashU) web page at the 
Bristol-Myers Squibb Pharmaceutical Research Institute (allows 
BlastN2, BlastP2, BlastX2, TblastN, and TBlastX searches). In 
addition, peptide files generated by the MAGPIE analysis were 
automatically searched using a FASTA algorithm. 

3. Best match and probable identification 

Analysis of the BLAST? 2 and FASTA output led to an assign- 
ment of a best match and probable function. The best match was 
usually the top scoring rr.atch, although sometimes another match 
was given because it was a more relevant homolog, or no match 
was found with a significance greater than >e-4. Probable func- 
ticr. represents the best estimate of function given the initial 
analysis of the BLAST data and the published literature regard- 
ing the best match, and nay not necessarily represent the true 
function of the gene product (hypothetical proteins are of un- 



WO 00/22139 



PCT/US99/23535 



30 

known function) . A higher probability score indicates a higher 
liklihood that the probable function corresponds to that of the 
best match; e.g., the polyketide synthase matches are all above 
e-100, and given the very high significance scores are presumed 
to function as polyketide synthases (as are the high scoring 
peptide synthetases) . 

The following is a summary of the sequence data from the 
pEPOcosS region, pEPOcosS, A5, Sau4 and A2 . 

a. Data from pEPOcosS region: 

Summary: A large PKS/PS cluster spanning multiple cosmids . 
An IS element (designated IS-Scl here) is found in the cluster 
- this may be a potential tool for genetic analysis of Soran- 
gium. 

Statistics: Sequence was assembled from over 2000 random 
sequences (forward and reverse reads of the ca. 2 kb cloned 
fragments derived) . 

47,713 nucleotides of contiguous sequence (no pFD666 vec- 
tor included) 

DNA sequence data are as defined in claim 7. 

Note: pEP0cos€_0RF7 sequences (cf. claim 9): the predicted 
N-terminus of ORF7 shows 145 nucleotide overlap with ORF6 . 

Note: pEPOcos6_ORF8 sequences {cf . claim 9) : >pEP0- 
cosS_0RF8.seq ("ORF9_up" in Fig. 2) 
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57.3% G+C 

Table 3 shows ORF data summary. Note: pEPOcos6 jDRFl . seq is 

truncated at its 5' end; correspondingly pEPOcos6_ORFl . pep is 
truncated at its N-terminus. 

b. Data from pEPOcos8 region: 

Summary: Two PKS genes found on a cosmid. A TnlOOO inser- 
tion is also found (occurred during E. coli propagation) . No 
peptide synthetase genes were found; one ?4 50 hydroxylase was 
identified. 

Statistics: 1952 random sequence reads from the pEPOcos8 
library were assembled using phrap, with 1024 of the sequences 
assembling into 57 contigs. 12 of these contigs were chosen 
(totaling 56,537 bp) which each contained >6 reads and con- 
sisted of about 1000 bp or more. The sequences of these 12 
contigs and the associated ORFs are given below. 

DNA sequence data from contigs are as definded in claim 
10. Table 4 shows more data. 

pEPOcos8 protein data are as defined in claim 11, i.e. for 
selected ORFs (polyketide synthase, peptide synthetases, or 
ORFs with high similarity to known genes) . 
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c. Data from cosmid A5 insert: 

Summary: A cluster =f PKS and PS genes found on the cos- 
mid. Other genes possibly involved in this secondary metabolite 
production include a downstream lipoxygenase gene higly simi i ar 
to eukaryotic orthologs. 

Statistics: 880 random sequence reads from the AS library 
were assembled using phrap, with 530 of the sequences assem- 
bling into 12 contigs. 3 =f these contigs were chosen (totaling 
41,556 bp) which each contained >100 reads and consisted of 
about 9000 bp or more. The sequences of these 3 contigs and the 
associated ORFs are giver, below. 

DNA sequence data from contigs are as defined in claim 12. 
Table 5 shows more data. 

Protein sequence data from selected A5 ORFs are as defined 
in claim 13 . 



d. Data from plasmid Sau4 insert: 

Summary: Insert contains PKS genes on two large contigs - 
most similar to the soraphen PKS gene from Sorangium. 

Statistics: 565 rar.aom sequence reads from the Sau4 li- 
brary were assembled usir.g phrap, with 84 of the sequences as- 
sembling into 18 contigs. 2 of these contigs were chosen (to- 
taling 6596 bp) which each contained >10 reads and consisted of 
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about 1000 bp or more. The sequences of these 2 contigs and 
the associated ORFs are given below. 

DNA sequence data from plasmid Sau4 contigs are as defined 
in claim 14. Table 6 shows more data. 

Protein sequence data from selected plasmid Sau4 ORFs are 
as defined in claim 15, 

e. Data from cosmid A2 



Table 7 shows ORF data s 



ummary 



F. Construction of suitable recombinant expression vectors 

1. Expression in Mycobacteria 

Heterologous expression of the ORFs shown in Figure 1 is 
performed by using a derivative of plasmid pSUP102 (Simon, R, , 
Priefer, u. , Punier, A. , Methods in Enzymology (1986), vol. 
118, pp. 643-659). In this plasmid the gene for chloramphenicol 
resistance is changed for a cassette comprising the gene for 
streptomycin resistance and the promoter element of the Tn5 
transposon. Short homologous genomic DNA segments from the host 
organism are ligated with the DNA sequences of Figure 1 and 
with efficient regulatory elements into, for example, the EcoRI 
restriction site cf the vector. Following amplifiction of the 
vectors in Escherichia coli the DNA is transferee by electropo- 
ration of the host cells or by conjugation with Escherichia 
coli S17-I (Simon, R . , Priefer, u. , Punier, A., Biotechnology 
(1983), vol. 1, pp. 784-791). 
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By means of the tetracycline or streptomycin resistance, 
respectively, mediated by the vector the host cells are checked 
for integration of recombinant plasmid DNA into the chromosome 
by homologous recombination. 

2. Expression in Streptomyces cells 

Heterologous expression of the ORFs shown in Figure 1 is 
performed by using bifunctional Strepomyces-Escherichia coli 
cosmids pKU206 and pOJ466. 

3. Expression in Escherichia coli cells 

Heterologous expression of the ORFs shown in Figure 1 is 
performed by using "bacterial artificial chromosomes", cosmids 
(for example Supercos, Stratagene GmbH, Heidelberg) and T7 ex- 
pression systems (Stratagene GmbH, Heidelberg; New England Bio- 
labs Schwalbach, FRG) . Expression of recombinant enzymes occurs 
in Escherichia coli cells consti tut ively expressing phosphopan- 
tetheinyl transferase required for the formation of holoenzyme 
polyketide synthetases and polypeptide synthetases. 
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Table: 7. ORF data summary from A2 insert: 
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Claims 

1. DNA sequence, the expression products of which cause an 
enzymatic biosynthesis, a mutasynthesis or a partial synthesis 
of polyketide or heteropolyketide compounds or are involved 
therein. 

2. DNA sequence according to claim 1, wherein the polyketide 
or heteropolyketide compounds are epothilones. 

3. DNA sequence according to any of the preceding claims, 
wherein the DNA is derived from myxobacteria . 

4. DNA sequence according to any of the preceding claims, 
wherein the DNA is derived from Sorangium strains. 

5. DNA sequence according to any of the preceding claims, 
wherein the DNA is derived from Sorangium cellulosum. 

6. DNA sequence according to any of the preceding claims, 
wherein the DNA is selected from the group consisting of: 

(a) the following DNA sequence: 

Seq ID No 1 (A2 cosmid) 

GGATCGCGGCGCCCTCGCGCTGCTCCTCGAGCGTGCGGAGGAACTCCCACGCCAGGCGCGACT 
TGCCGAGGCCAGGCGCGCCCACCACCACCACCGCGTTCGCGGAGGGCTCGTCGACGCAATGGC 
GCCACTCGGTCGCGAGCTGCGAGAGCTCGCGCTCCCGCCCCACGCAGGGCGTCGGCTTGCCGA 
GGAGCCGTGGGACGGCATCCGGCTCCTCCTTCGGGCCGCGAAGCCAGCACCCTCCGGGCCCCT 
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GTACCGTCTCGAAGCGGCTCGCGAGCAGGCTGGCCGTCGCGTCGTCGAGCCGGATCTCCGGCG 

GCGACAGGCCATCTCGCCCGGCGATGAGCTGCGCGACCCGATCGACCAGCTCGCCGACCGGCA 

GCCTCGCCTCGACCTCGGCCAGCCCTGTCGCGACGGACACGGGCACGCCTCCGAGCGCCGCCC 

GCAGCGCGAGGGCGCAGTGGGCCGCCCGTGTGGCGAGATCCGTGGGCGACTCGGCGCCGGACA 

GCGCGACGAGCCACCAGCGCGCTTGCAGCCGATCGAGGCGCCCGCCGTGGCGCGCCGCGATGT 

CCCGCAGCGCCTCGGCCCGCGCGGCGCCGTCGTCCTCCGAGAGCGTGGCGCCGGCCTCGGCGC 

CGCCGTCTTCGGCCAGGATGACGCACATCACCTTGCGCTCGGCCGTCGTGATCGCCTCGCCCG 

GCGCGGCCGGCGCCGCGACCGCGCTCGCCCCGATCGAGAGCCCCTCGCCGGCCACGGCGGCGA 

GCTCCGCCGCGGCGGCGGCGCCGTCGCGCGGCCGCTCTCCCGCGTTCTTCGCCAGCATCCGCG 

CCACCAGGCGCTCGAGCGGCTCCGGGATACCGTCGCGGAGCTCCCCGAGCCGCGGCGGCTCTT 

CCAGGACGACCCGCATCAGGAGCGCGAGCGCGCTGTTGCCGAGGAACGGCGGGCGCCCCGCGA 

GGCACTGGAACAGCACGCACCCGAGCGCGAACACGTCGGCCCGGGCGTCGACCGGCGCGTCGC 

CGCGCACCTGCTCGGGCGCTATGTACCCGGGCGTGCCGAGCACGGCCCCGGGCGACGTGAGGG 

TCGGCGCGAGCCGGAGGTGGCGCGCGATGCCGAAGTCGAGCAGCGTGACGCGCTCGACCGCGC 

CGCCCACGAGCATCAGGTTGCTCGGCTTGAGGTCGCGGTGAACGACGCCGAGCCAGTGGATCG 

CGCCGAGCGTCGTGGCCACGCGCGCGGCCAGCGCCACGCTCTCGGCCAGCGTGAGCGGCGCCC 

CGGCGAGCCGCTCCTCCAGGGTCACGCCGTCGAGCCACTCCATGGCCAGGTACGGCCGCCCTG 

CGCCGGTCACCCCGTGCGCCACGTACTGCACCACGCCGGGCAGCCGGAGCGTCACGAGCGCCT 

CCGCCTCCCGCGCGAACCGGCGCAGGTCGTTGGCGCTCGCGCCCTGCAAGACCTTGAGCGCGA 

CCGCCTGCCCGGACACCCGGTCGCGCGCCCGGTACACGTCCCCCATCCCGCCGGAGACGGCGA 

GCCGCTCGATCTCGAAACGATCCTCGATCACATCCGCTGCGCGCATGGCGGTGCCAATGTACT 

CCGCGCGAGCCTCGGGCCCCCGCGCGTAAGTGCGGCCCTGCGCCCGGTTGAACGCCAGCCCGA 

GCGTGACCGCCTCGCGCTCGGGATCCACGGCCGCCGGATCGGTCCACGCCTCGACGAGCGCCT 

GCGTTGAACAACCCGCCACCGGGCGCACGCAGCCGGCATCGCCGCGCTGGCCACCCGGCGCTG 

CCGCCCTTAGGCTCACCTCCGCGATGCCCCGCTGGTTCAACACGGCAGGTCCCTGCAACCCGG 

CCGATCACTACATGCTCCCGGCCGAGGAGCGCTTGCCCGCAGTGCGCGATCTGGTCGATCGCA 

AGGCCTACTTCGTCCTGCACGCCCCGCGGCAGATCGGCAAGACGACCTCGCTGCGCACGCTCG 

CCCAGGATCTCACGGCCGAAGGGCGCTACGTGGCCGTCCTCGTCTCGGCGGAGGTCGGCGCCC 

CCTTCTCTGACGATCCCGGCGCGGCCGAGCTCGCGATGCTCGCAGAATGGCGCGGCACCGCCG 

GCGCGCAGCTCCCCGCCGATCTGCGGCCGCCACCGTTCCCCGATGCGCCCGCCGGTCAGCGCA 
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TCGGGGCCGCCCTGCGCGCCTGGGCTCAGGCCGCGCCGCGCCCGCTCGTCGTCTTCCTCGACG 

AGGCCGACGCCCTGCGCGACGCGAC.GCTCGTCTCCCTATTGCGCCAGATCCGCAGCGGCTATC 

CCGACCGCCCGCGTGACTTCCCGCACGCGCTCGCCCTCGTCGGCCTGCGCGACGTGCGCGACT 

ACAAGGTCGCGTCGGTCGACAGCGGCAGGCTCGGGACGTCGAGCCCCTTCAACATCAAGGTCG 

AGTCGCTCACGCTGCGCAACTTCACCCGCGACGAGGTCGCAACACTCTACGCTCAGCACACGG 

CCGAGACCGGTCAGGTCTTCCGGCCGGACGCCGTGGACCGCGCCTTCGAGCTCACCCAGGGCC 

AGCCGTGGCTCGCCAACGCGCTCGCCCGCCAGCTCGTCGAGGTCCTCGTCAAGGACCGCGCGC 

AACCCATCACGTCTGCGAACGTCGATCGCGCCAAGGAAATCCTCATCGAGCGGCAGGACACAC 

ACCTCGACAGCCTGGTGGATCGGCTGCGCGAGCCGCGCATCCGCGCGGTGATCGAGCCGATGC 

TCGCCGGCACCGCGTTGCCGAGCGTGCCCCCCGACGACCTTCGTTTCGCGATCGACCTCGGCC 

TCGTGCGCATGACCGCGGAGGGCG3CCTCGACGTCGCCAACCCCATCTACCGCGAGATCATCG 

TCCGCGAGCTCGCGTTCCCGATCCGCGCCTCACTCCCCCAGATCAAGGCCACGTGGCTCACGC 

AGGACGGCCGCCTCGACGCGGACCGCCTGCTCGACGCCTTCCTCTCCTTCTGGCGCCAGCACG 

GCGAGCCGCTCCTCGGCGCCGCGCCCTACCATGAGATCGCCCCGCACCTCGTGGTGATGGCCT 

TCCTCCACCGCGTGGTGAACGGCGGTGGCACCGTCGAGCGCGAGTACGCCATCGGCCGGGGCA 

GGATGGATCTCTGCGTTCGTTACGCGGGCGAGACGCTCGCGATCGAGCTCAAGGTCTGGCGAG 

ACGGCCGCCCCGATCCCGTCGCCGAGGGGCTCGCCCAGCTCGACGAGTACCTGGCCGGCCTGG 

GCCTCGATCGCGGATGGCTCATCCTCTTCGACCAGCGCTCCGGACAGCCCCCCATCGCCGAGC 

GCACGCGCCGCGAGCGCGCGCTCTCCCCCGCCGGCCGCGAGGTCGCCGTCATTCGCGCCTGAG 

GGAGCTCGCCGCGCGGCGAGCGCCCTCCACGAGGGCCGGGCCACCTCGGACAGCGTCTCTACT 

CCTCCGAGGCCGCCGCGCCCCCCGCCCCGGCCGCCGCCGCCGCCGCCGGCTCCAGCTCGCAGC 

GCACCACCAGGACCTCGCCATCCGCGAGCTCCGGCCGCTCCACGAGCGCGTGCGCGCCCGCGC 

GCACCGCCGTGAGCACGTCTCCCAGCGCCGGCTTCAGCCGCGCCAGCGTCGCGGCGTTCGCCC 

CGAGCGCGAGGTCGGTCACGACGCGCCCCACGCTCGCGCCGAGCTCGCTCTTGCGCTTGTTGA 

CCGCCGCCATCGCCGCCGCCGCCAGATCCAGGAGCCCCGGATCCGAAGGCGCCGCGACCGCCG 

CGAAATCCGCCGCTGAAGGCCACTTCGCCCGGTGGATCGAGGTATCGCCCGTCTCCTCCGCGT 

ACACCCAGCGCCAGACCTCGTCGGTGATGTACGGCAGGACCGGCGCGAACAGCCGCAGCAGCA 

CCGACAGCCCGAGCCGCAGCGCCGCCACCGCCGAGCCGCGCGCCGCCTCCCCGGCGCCGCCCT 

CGCCGCGCGCCCGCGCCTTCGCGAGCTCCAGGTAGGCGTCGGTGAACCAGCGCCAGAAGAAAT 

CCTCGGTCCGCTCGAGCGCCGCCGCGAACTCGTGCTCGTCGAACGAGCGCGTCGCGTCGTCCA 
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CCACGGCCGACAGCTTGTGCAAGAGCGCCCGGTCGAGCTCCTCGGAGATCGGGTGGACCTCCG 

CCGACTGGCTGAGCACGTACTTGCTCGCGTTCCAGATCTTCGTGACGAGCCGCTTGCCGATCT 

TCAGCACCTTCTCGTCGAACGCCGTGTCCGTGCCGAGCCGCGCGCTCGCCGACCAGTAGCGGA 

CCGCGTCCGAAGAATACGTGTCGAGCAGGTGCATCGGCGTGACGACGTTGCCCTTGCTCTTCG 

ACATCTTCTTGCGATCCGGATCGAGGATCCACCCGGAGATCGCGACGTGGTGCCACGGGACCG 

ACGACTCGTGCAGCATCGCCTTCGCGATCGTGTAGAACGCCCACGTCCTGATGATGTCGTGGG 

CCTGCGGCCGCAGATCGGCCGGGAAGAGCCGCGCGTGGCGCGCCGGATCGTCCCCCCAGTGAG 

AGCTGATCTGCGGCGTGAGCGAGCTCGTGAACCACGTGTCGAAGACGTCGGACTCGGCGGTGA 

AGCCGCCGGGCTGGTCCCGCTGCGACGCCTCGTACCCGGGCGGCACGTCGACCGTCGGGTCGA 

CCGGGAGCATCTCGCGCGTCGCGAGCAGCGGCCGGCTGTGATCCGGGTTGCCCTCGGCGTCGA 

GCGGATACCAGACCGGGAACTGCACGCCGAAATACCGCTGGCGGCTGATGCACCAGTCACCCT 

GGAGCCCCTCGGTCCAGTTGCGGTACCGGAGGCGCATGAAATCCGGGTGCCACTTGATCTTGT 

CGCCGTATTCGAGGAGCTCGGCCTTCTTGTCGGCGAGCCGGACGAACCACTGCCGCGTGGGCA 

CGAACTCGAGCGGCTGGTCGCCCCGCTCGTAGAACTTCACCGCGCGCTCGATCGGCCTCGGCT 

CGCCCCGCAGCGCCGGCCCCCGGCCGGGCGCCGCCGCGTGCTCCTCGCGGCGGAGCAGCTCGA 

CCACCGCCGCGCGCGCCTGCTTCACCCCCCTGCCCTGGAGCGGCGCATACGCGGCGTTGGCCG 

CGGCCGGGTCGCGGCTCTCCCACGCGCCCTCGCCGAACGTCACCGGCAGGACACGGCCGTTCT 

TGCCGAGCATCTGCCGGAGCGGGAGCTTCTGCTCCCGCCACCAGATCACGTCGGTCGCGTCGC 

CGAAGGTACAGACCATCAGGATGCCCGTGCCCTTCTCGCGATCCACGAGCGGGCTCGGGAAGA 

TCGGCACCGGCGCGCGGAAGATCGGGGTGAGCGCCGTCTTGCCGAAGAGGTGCTGATACCGCG 

GGTCCTCCGGGTGCGCCGTGACGCCGACGCAGGCCGCGAGCAGCTCCGGGCGCGTCGTGGCGA 

TGACGAGCTCCTCGGCCGTCCCCTCCACCGCGAACGCGATGTCGTGGAACGCGCCCGATTGCG 

GGCGATCCTCGACCTCCGCCTGGGCGACCGCGGTCTGGAAATCGACGTCCCACATCGTCGGCG 

CGAAGACCGAGTAGAGGTGGCCCTTCTCGTGGAGATCCAGGAACGACAGCTGCGCCGTCCTGC 

GGCAGTGATCATCGATGGTGGCGTACTCGTTCCGCCAGTCGACCGAGAGGCCCACCCGGCGGA 

AGAGCGCCTTGAAGACCTGCTCGTCCTCGCGCGTGACCTTGTGGCAGAGCTCGATGAAGTTGG 

GCCGCGACACGATGCGCGGCGGCTCCTTCTTGATCGTCTCCGGCGCGGCCTGCGGCAAGGTCA 

GGCCGCGCTCGTACGGCGTGCGCACGTCGGTGCGGACGTGGAAGTAGTTCTGCACGCGCCGCT 

CGG7GGGCAGGCCGTTGTCGTCCCAGCCCATCGGGTAGAAGATGTTGAAGCCGCGCATCCGGC 

GCTGGCGGACGACGACGTCCGTGTGCGTGTAGCTGAAGACGTGGCCGATGTGCAGCGAGCCCG 
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AGGCGGTCGGCGGCGGGGTGTCGACGACGAAGGTCTCCTCGCGGGGGCGCGACGGGTCGTATC 

GGTACGTCCCGTCGGCCTCCCACAGGTCGGCCAGGCGCAGCTCGGCGACGGGCGAGTCGAAGT 

GCTTCGGGAGCGTCGCGGGATCGATGGAGCGGAACGTCTTCTTGATCGTCACGTGGTCACCTG 

CAGAACAGACCCCGCAGGAACCGCCCGCGGGGCCGGCATCCTACGTCGTCCCCCGGGTGCCGC 

TCAAGGCGCGCCGCGCCCGCGCGGCGGCGATCCGCGATCGCATCCGCGCATCCGCCAGAGCCC 

GGCGGCTCCGCCGGCGCGCGCGCGCCGTCCGTGGAGCCGAGAGGAGAGGCCGGCGCCCAGGTC 

GTGGAGGACGCCGGCGGCGCCGCCGCGGAGATCGCGGAGAGGCGGGCGCATCGATCGCGGCGA 

GGCCGGGGGCTCAGTCGTAGCGCTCGACGTGGACGTGCTTGCGGTGGACGCCGAGCTCGCCGC 

GGGCGAGCTCGCGGACGGACGAGACCATCCGATGCAGGCCGCAGATGAAGACGTGCGGCGCCG 

GATCTCCGCTCTTCTCCGCGAGCTCCCGGTAGAGCTCGGGCACGTGCGCCTGCACGTAGCCGC. 

GGCGGCCGGCCCACGACGGGCCGCCGCGCGAGAGCGTGATCTCGTAGCGGATCCGGTCGGATC 

CGCGCGCGAGCGCCTCGAGCTCGTCGCGGTAGATGACGTCCTCCTCGAAGCGCGCGCCGAACA 

GGATCCACAGGTGGGGCGCGGCCAGCCCCGCGCGCAGGGAGGCGCGCAGCATGCTCCGGAGCG 

GCGTGATGCCGGTGCCGGTCGCGACGAACAAGGAGGGCGCGGAATCCCCGGGATCGCGGGTGA 

AGAGCCCGTGCGGGCCGATGGCGCGGAGCGTGGCGCCGGGCTCGAGCCGGTGCAGGTGCTCCG 

AGCCCGCCCCGCCCTGCACGAGCGTGACCGCGAGATCGAAGCGGGGCGAGCCGTCGGGCGCGG 

ATGCGATGGAGTAGGCGCGCTTCACCTCGCCGCCCGGGAGCGGGAGGACGAGGTTGACCCACT 

GGCCCGCCTCGAACAGAAACGACCTCCCGTCGGCGCGCTCGAACGAGAGCTCGCGCACGAAAG 

GGCTGAGGGGCCGGGCGGCGACGAGGCGGGCTTCGAACGGTTCGGCGTGGATCATGGTCGGGG 

CCCGGCGGGGCTCGGCTGCGAGGCCGCGCGGGTGGCGAGGTCTTACCGCAGCCTGCGCCCCGG 

CCCAATCGCGATCGCCGCGGGAAGGGCGCCGCCGGAGGGCGCGCAATCGCGGGAATCACGGGC 

TTCCGCCCCGTGCGCCGCCGGAGCGCGCGGCCGGGCCGCCGGCCCGCGCTCCGGCGGGGAGCC 

GTCGCGGGCTCTACCGCACGCCCATGCGGCGGCGCTGCGGGATGTTCACCGCCGGCCGGGAGC 

GATCCTGGTTGGGGAGCGCGCGCGGCGGGCGGGGATCCCGGTGCGCGGGCTTCTGCGCGGGGA 

GCTGCCCTCGCTGAGCCGGGCGCTGGTCGGGCGACTTGGCCGAGCCCAGCGCGAGATCGGAGA 

CGGGGAGATGCGCGCGTCGCTGCATAGAATCCTCCATGGAATCGGTCATCAACACATCGGGAA 

GAGCACCCAGGCTGAAAGAAACCTTCGAAGAACCGGCTCTCATACACCCTCCATTCATCGTGC 

GACCCCGGATTCAGGACGGATCGAACCCGCGAGGGACGCTGGCTCTCTGGGCCTCTCCCTGCT 

CGCTCGACCGGCGCCCTCTCGACGCAACTCCGCCGTTCGTCGGGACGGGACGGTCCGCCTCGC 

CGCACGCTCCCCGTCGAGACGACTCAGCGTCTCGACGTCAGGAGAGATGACGACTCGGCCCGT 
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CGCGCCACGACCCTTCCGGCTCGG73CTTCGAGCGCGCGGCCAGCGAGCGAGGGGCGATCGCC 
AGGAGATCACGAATCTCCCGGCCATCGGCCTCCAGCGCCTCGGGCTCGTTCGCTCGTCGCCCC 
GCTCCGTCCCCGCGCGCGCACGACGCGAGCTCGCGCGGGGAACCGCGGGCCGCTGTCGTGGCT 
GCTGATGCGCGACGATACAGGGGGGACGCCGTGCCTACCTGGGCAACAGGCGCTCATCTTCTA 
CCACGGCGAGCACTACGGTGAGTGCTGCCATGAGTAGGCCCCTGAGGGTCCGCGCGACGGAGC 
GTGGTGTCAGCGAGAGATGCGCATGGTGGACGCGGGCTACGCGTCGAGAGGGACACTAGCACT 
CGACCTCGATCCTGCCCAGCACTTTTTGTCGGGGAGGGCTGCCCTCCCGCTGGCCGCTGGCCG 
CTGGCCGCTCGCCGCTGGCCGCTCGCCGCTGGCCGCTGGCCGCTCGCCGCrrGGCCGCTGGCCA 
TGTGCGACGTGAGCTCGAGCAGCCCGCGGCTGACGGACAGACCCCGGAGTTCATCGAGCCGGT 
GATGCCGAACCCGCCAAGCGAAAAAACGTATCCGTTCGGCAGGTCGTGGCCTATCATGCAAGC 
TGCTCGATGCGCTGACAGGCTTCTTCGAGATCCTCGTCGGTCTTTGCGAAGCAAAACCGCATG 
AAGCGACTCCCCTGCGTCCCTTCAAAGAAGGCGTCGCCTGGCACGCCCGCCACCCCGGTCTCG 
, TCGAGCAAGTAGATGGCTCGCTCTCGACCTGTCCTCCCGGGTAGGCGAGAOVCATCCGCCAGC 
ACGTAGTACGTCCCCTGCGGCACGCAGGGTGGCAAGCCCGCTTTCTCCAGCGCCCGACAGAAC 
CGGTCTCGCTTCCGTTCATATCCCTGGGCAAGCCCCGTGTAAAACGAGCGAGGAAGGCCGCGG 
ATCCCGGCAGCGACTCCATGCTGCAGCGGCGTCGGCGCGCAGACATACAGCAGGTCGCTCATG 
GCTCCAATGGCCTTCGCCCACCTGGCATCGGCCACGCTGTAGCCGATCCGCCATCCTGTGATG 
GTGAAGGTCTTCGAGTAGCCGCCTATCGTGATCGTACGCTCGGACATGCGCGGAAGGGAGGCG 
ACGCTGACGTGCTCACGGCCGTCGAAGATAAAGTACTCGTAAATTTCGTCCGTGATCACCATG 
AGGTCATGGTGGCAGGCGAGATCGGCGATCTGTTCCAGCTCCATTCGGCCGAACACCTTCCCG 
GAAGGATTTCCAGGAGAGTTCACCACGATCGCCTTGGTCTTCGGGGTGATCGCGCGCTCCAGC 
TCGTCGCCGTCGACATTCCAGCTCAGGGATCGCGCCGTCACATACCGCGGAACAGCCTCGACG 
GCGAGGATAGCCTGGGCGTGATAGGCATAAAACGGCTCGAAGAGCAGCACTTCGTCCCCAGGA 
TTGAGCAAGGCCATGCAAGTGGCCTGAAAGGCCCCTGTCGCTCCGGCGCTCACCGTGATGTCA 
GTCTCCGGATCCGCCGCGATGCCATTATGGCGAGCCAGCTTCGCCGCGATCGCATGGCGCAGC 
TCCACGATGCCGTCGAAGCGCGAATATGTATTGCACCCCCGATCCATCGCCTCCTTCACCGCT 
TGAAGGATCACCGAAGCyVACTGGGGTATCACAGACGCCCTGGGACATATTGATCCCATGGACC 
TTGGCGCACGCCAGGGTCATGGTACGGATATCGGACTGGGCGAGGCGAGCCGCACGATCACTC 
GGTAGACTCTTCATCAGCGTGCTCC7GCTTCTGTTCTGCGGCTCTGCATGGTGTCTTCGGGTG 
GGCTTGTCAGCTCGACGCGCCCATGCAGCGGCGCAGCCCTAGCGGCCGCAGGTCTGTCCACAC 
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TTCTTTGATGAAAGCGAGACATTC-3C-CTTTCGTGCCCTGTTTGCCCGCAGCCCTCCAGCCCCC 

AGGTACGGGCTTGTCGGCGGGCCAGATCGAGTACTGCTCTTCGCCGTTCACCACGACCTGGCA 

ACGCGTCTTGCTTTCGTCGTCCCGATTCATGATTTTCCTCGCCCTTCGTCAGCGCTGCGCGAG 

CATGAAACGAATCGCTCATCGGCGCACAGGCGCGCGCCGGCTGCCCGGAGGCACTCCCACGCC 

TCCCTCACGGCAACCTCATCGCTCCGGATGTTCCCGATGGCGACTCGGATCGTGTACCTGCCG 

TGGAGACGGGTATGGGACAAAAATACCCTGCCCGACTTGTTGACCTCGTCCAGCAGCGCCTCG 

TTGAGGCGATCGAGCTCGCGTTCGATCGACTCTCTCTCTGCCTCGTCCGCCGACCGCATCATG 

CAAGCGAGCGCGGAGGGCCTCATGC3AAAGCAGACCGTACTGAACGGCGTCGGCGCGAGGCGC 

TCCCAATCGGGATCGGCGTCCACCCACTGGGCGAGCTGCTGCCCCAATCGGAGGTGCTCCCGG 

ATCCGGGCCGCCAGCCCTTCATGCCCGAAGTAGCGCACGATCATCCAGAGCTTCAGCGCTCGG 
AAGCGCCGACCGAGCTGGATACC Z CAGTCCATGTAATTCGTGACGTCGCCCTCGGTGCGGAGG 

TATTCGGGCACCAGACTGAACGCGCGCTTCAGTCGGTCGGCGTCACGCACGTAGAGCACGCTG 

CAATCCATGGGGGTGAACAGCCACrTGTGAGGGTTCACTACCAGCGAGTCCGCCCCCTCGCAG 

CCCGCGAGCACGTCCCTGTGCTCGGGGACGATCGCGGCCATCCCCGCGTAGGCCGCGTCCACG 

TGAAGCCATAGCCCGTGCTCCCGGCAAACGCTGACGATGGCGGGGATGGGGTCGACGCTCGTC 

GTGGACGTCGTGCCCACCGTCGCCGCGACGCAGAAGGGTCGGAGGCCGGCCCCGAGGTCCTCC 

ACGACGGCGGCGCGCAGCGCCTCGGGGACCATGCGGAAGGCCGGATCCGTGGGGATCTTCCGC 

ACCCCCTCCTGCCCGATGCCGAGGGTGATGGCTGCCTTCTCGATGGATGAGTGCGCCTGCTCC 

C^CGCGTAGAGTCGCATGCGCCGCTGTCCCGCCATGCCCCGGAGCCGGATGGTCGGCTCGGCC 

GAC-TCGCGCGCGGCCGCGATCGCGACCATGCTGGCGGTCGACGCGGTGTCCATGATCGCGCCG 

TGCAAGCCGGCGTCGAGATCCAGCATCTGACGCAGCCAGGAGAGGACGAGCTCCTCGAGCTCG 

GTGGCCGCCGGCGACGTGCGCCATAGCATCACGTTGACGTTGAGGCACGCGGCGAGCAGCTCG 

CCGAGGATCCCAGGACCAGACGCCGTGTTCGCGAAATACGCGAAGAATCGCGGATGATTCCAG 

TGCGTGATCCCCGGCAGAATGATCTGCTCGAAATCGGTGAGCACGGCGTCCATCGGCTCCGGC 

TCGACGGGCGGGGTCGGGGCCAC-CCTGCCCTTCACGTCGCCGGGGCGGATCGCGGGAAAGACG 

GGGTATCGATCCGGGTGGCCGAGGTAATCGGCCGCCCAATCGATGATTCTCATACCGATCCGG 

CGGAACTCCTCCAGATCCATGTCCCCGAGCCGTTCTTTCCGCGGGTCGCTCACGTCAACCTCC 

TCGCCCTGCCAGGACAGGATCCTCGAGGTCCCCTGGCTCCGGCGGTGGAAAGCGCTCCTTGAA 

CGTGAAGGCCCACGGGGTCGGTCCC-TAGCGCCGCAGGTGCTCGAGCCGATCCTGCCCCTCGCG 

GACGGACGGGATGTGCCCGGCCGGGACCCACCACAGCACGAGGTAATGCGGCTCGAGATGCTC 
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GAACCACCGAGCGCGCTGTCGCAGGAACGCGGCATGATCCGCGGTGTAGGTGAAGGCGAACAG 

GTGCTCGATGGAGGTCCATACCGACAGGGTCACGAGGAGCCGCTGGTCCGGGTACGGACGGAT 

GGACACAGAGTTCCCCTCGGCCGTCTGCAGGCGCCACACGAACCCCTCGCTCCGATCGGCCAG 

ATGGTTGATATGGTCGAGCCCCTGGACGAAGCCCTCCATGATCGGATCCTCCAGCGGAGCGCG 

AATACATGCGAAGTTGTATTGCGCGATGTGGTGCCGATGCTCCGACATGTCGCTTTCCATCTC 

CAGCTCCCGCTCACCAATCCCAGCGCTGCTCCGGGGAGCTCATCAGGGCAGACGCGACATCGA 

TCCCGAAGCTCCGCCGCATCCCCTCGACGAAGGCGGCCTGGACCGCTTCGGCGACGGATCGGC 

CTGCCTCCGGCAAGACCTCGGAGACAAAGAAGAACCGCCTCGTGGAAGGGACAATCTTGCCCC 

GCTCCGCCTGGCGCCATACGAAGTGCCTCGTCACCAGTCCCTCCGCGTCGGCATACCCGACCT 

CGCCGGCGCCGACCGCCACGCTGCCGCCTGAGCCGAGCTCCACGAACGCCTCACCGCCTCGCG 

AGATCTCGAGGCGAACGTCCGGGCCAGCCAGATCGCCGAGGTCCCAAGCGCCGACGGGGACGG 

CGAACCGCAGCGACAGGAGGTTGTAAAAATCGACGAATGCGTTGATGTGCGGCAGCTCTCCAC 

CACCGAGGACCCGCTTCGCCAGCGCCTCGATCGAGCTCGGAAATTTCTTGCCAGAGACCCCCA 

CTCGCTTCATCGCCTCGCGCCAGGCAGCCACGTGCGGATGCGACTGGGCGTTTTCGTGGCCCC 

AGCTCCGTCGCAGCTCCTCCTCGACCTTCCGGAGCTCCTCCAGCACGGCCGGCCGCTCTGCGG 

CGTTGTCCAGGCCTTCCCCGTACCCGGTGACCAAGATCATCCCAGGAAACGACTCCCAGATTC 

GCGGATCGACGATGAATGCCATGTGCCTCCTGCCCCTCGAGAGCGATCGCCTCGATCGACACC 

AGGCTGTGGATGCATGAGCCGGGCCGTGCGGACGCAGGACCCCGCTACTCATGGCTCTTCGTG 

GCCGATGAACAGGTCCTCCACCCGTCGATCGTGCTCGGTGCCCCGATCCGTCCAGTCCCACCC 

GCCGGCGACCGCGATGTTTGCACCCGAGACGTACGAGGCGCGGTCGGAGGCGAGGAACGCCAC 

AGCATCTGCGACCTCGCTGGCGCGCCCCAGGCGGCCCATGGGGACGCGCCGCTCCATCCACTC 

CTTCTGCGCGGGCGGAAGGTATCCGTTGTCGATGAGCCCTGGAGACACACAGTTGACCAGGAT 

TCCATGAGGCGCCTCCTCCGTGGCCAGGCTGCGCGTGAGGATGAGCACGCCGGTCTTCGCGAT 

CGAGTACGCCGCCACGTTCGGCGCGCCGCGGATCGCGTACGTGGGGCTCAACCCGATATTGAT 

GATCCGGCCGCTCTTTCGCTGGCGCATGCGCGCCACGGCCGCGCGACAGAGGTAATGAACGCT 

GCTCAGGTTGCTGTCCATGACGTTGCGCCATTCGTCGTCGGTCATCGCCGCAAGCGGCTTGAA 

GAAGAAGTCGCCCACGTTATTGACGAGGATGTCGATGGGGCCCAGCTGCGCCTCGACGCTGGA 

GAAGAGCTCCGCGGCCGCGTTGGGGCGGGTGACGTCGGCCTGCACCACCATGGTTCGTCGCCC 

GAGCGCGCGGATCTCGGCCGCCGTCTGCTCGGCCGCATCCTTGTTCGAATGGTAATTGACGGC 

GACGTCCGCGCCTTGCTCCGCGATCCGCAGCGCGATCGCCTTGCCAATTCCGCGCGAGCTACC 
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GGTGACCAGGGCGACGCGCCCGGCGAGCTCCAGCGATCGCGCCTGTGGCAGGGCCGGAGCAGC 

CTCCTGGTGGAGCTCGACGTCGACGGGGAGCTCCACGTGGTAGCTCGTCTCTCGCGGAGCCGC 

GCAGTACCTCTCGTAGAACGCCTCGAGGACGGGCTCTTCGCGTCGCATGATGTCCGCGTGGGA 

TTCGGCGCTGCGCCACGGATAGAGCACCAGGATCTCGTCGGGGCGCACGGTGCTCTGAAAGAA 

TCGCGCATGACCGCACCACCCCGGGTGTTCGTGGGCTCCTGGGCCGAG.CAGATCATCCATTTT 

TTGCATGATCTGCGTGGCCTCGCCCTCCATGCCGGGTTTGATGCGCCATCGCTCCATAACGAG 

GATCATGTCTTGGCTCCTGTTCGTCATCGCCGTTTCGATCTGGGGGGGCTGCCCGCGCTCTCG 

AGGGCGCGCCCCTTGTATTGGCCGCGGATGGTCTGGGTAGCGCTCGCGAGCTTTCGCTTGTGG 

GCGGCGTTCAGGCTTGCGCCTTGATTGACGAACCGCTCGCAGACGAATGCGTGCGATTCATAT 

GCGGTCGCGAGCGCCCACAGGTAGAGACGGTGGCTACGAGCGAGCTTGGGCGGAAGCGCCCTC 

AAGGCGGGTCGGATGGATTTGAGGGTGCGAATCAAGCGGGCGTGCTCTTCGAAGAAGAGACCG 

CTGAATCCATCCTGGACGAACGGCGGGGCCATGGTAGGTCGCACGATGTTGTTGTAATCATCA 

TTCGAGAAATCGCCGGTAAACCGGAAAGATGCCGCCGTTGCCCACCAGAGCTGCGTGGAGATC 

GCGAGCGCCTCTCGCGCCCTCGGAACGTCACGGTTGGCGAGCGCCTCCGACAGCCACCTGTGC 

GCCATGAGCAGCGCCTGGACCAGCACGAAGAACGCGTGATGCCCCAATACCCAGCGGCCCAGC 

GCGCCGTCCGGAGCGCCCGCCTCCGCCGGAGGCGCAGCCAGCTGCGGGATGCCATTCCATGAT 

TTTGGCCTTCGCTTGCCGGAGAACTGGTGGAGGATGTCCTCGATGGACGCACAGAGATGCCCC 

ATTTCCATGGGTTGCAGGGAAGTACCTTTCAGGCTTTCGCGGATCATTCGGTAATATGCGACG 

ATCACCGCTTCGCAGTACGTCTCGAGCGGGTCTGGGGCGGAGACCCGGTGCACATGGAAATAG 

GCGTCGTACTCCGCCTCCCGGTCCGACAGGCTCCCCGGGATGTCCGGATCTGCCGCTTGCGCC 

TGCCATCGCTCGAGGATGGGGACGGCGAGGGGGGCGAGATGCTCGGCGAGCACCGCCAGGTCG 

CCCTCCGGCGCATGCGCCAACAAGACCTCGAAGGCCTCGGCTGCGACCTTGGAGGTCTCGCCC 

ATTCCGACGGCCTCGATGGCTTGGGGCAGCGGTAGATGGATGGTATATTTAGCCATGATTTGC 

CCGAAGATTGCCGCTGCGTCGACAGATCTTTCGCGAGCCGGAACGCCATTTCCACTGCTCTGG 

CTCTCAATATTGAATTGAGCCCTGGCGACTGCCATAGGCCCAGTCGCTCGACACAGTGTACGG 

AGCGGCCCGATGCTTTCTCCTTTTTTAGTCCTGCACCGAATACTTCTGTTGGGCGCCAAAGAT 

CCCTTGCCGAGACTGTCCGGCGAGATGTCGTGTGCGAAGCGTCCGCACGTCCAGCGGGCCCAT 

GCGTTGCTAGAGCATAAAACGGTTCGATGCCTGGTCGAGAGGGAGACGCGAGGAGCCTCCCTT 

TGGGACGGATGAGGAATTTCGTGACCGAAATGTCGGCAGGAACAGCGGCGCAGAAGCGGCGCA 

TCGATGGGGAACCATGGGTTACGAAGACATTGATGATAATGTCGACGCAATCGCAATCGTCGC 
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GATGAGCGGCCGCTTCCCCGGCGCGA3AAACGTCGAGGAGCTGTGGCAGAAGCTCCGCGCTGG 
CGTGGAATGCGTCGTCACCTTCACAC-AGGCCGAGGCGCTCGCCGCGGGGGTGAGCCGCGAGAT 
GCTCGCGAATCCCAGCTACGTGCGCAGAGGCGCGCCGCTCGACGGCGTGGAGCTCTTCGACGC 
CTCGTTCTTCGGGTTCAGCCCGAGG3AGGCAGAGAGCATGGATCCGCAGCAGCGCATCTTCCT 
GGAGGTCGCCTGGGAGGCCCTCGAGCGCGCCGGTTACGACCCCGATGCCCATTCCGGGCCTAT 
CGGCGTCTTCGCGGGCAGCGCCCCGAGCGGCTACCACTCCCTGGCGCAGTCCGACCCGGAGAT 
CCTAGGCGCCCTCGGCCACTACCAACTGACGCTGAACAACGACAAGGATTATCTCACCACACA 
CGCCTCGTACAAGCTCAATCTGCGGGGCCCGAGCGTGTGCGTGCAGACGTCCTGCTCGACCTC 
GCTCGTGGCCGTGGTCATGGCCTGCCAGAGCCTGCTCAACCACGAGTGCGACATGGCGCTCGC 
GGGTGGCGTGGGGATCCATGCGCATCAGCGGAGGGGCTATCTGTATCAGGAGAACGGCATCTC 
TTCGCCCGATGGGCATTGCCGCGCCTTCGATGTGGCCGCCAAGGGCACCGTGGGCGGCAGTGG 
CATAGGCATCGTCGTCCTGAAGCGGCTCGCCGACGCGCTCGCCGACGGCGACCACGTGCACGC 
GGTGATTCGAGGAGCGGCGATCAACAACGACGGCTCGAGCAAGATCGGTTACACGGCGCCGAG 
CGTGCAGGGGCAGGCCGAGGTGATCGGCATGGCCCAGGCGCTCGCCGGCGTGGAGCCGGATGA 
CATCAGCTACATCGAGGCGCACGGCACGGGGACGCCGCTCGGCGATCCCATCGAGATCGCAGC 
CCTCACGCGCGTGTTCCGGGCGAAGACCGCACGAAGGCAGTTCTGCGCCATCGGCTCGCTCAA 
GACCAACCTCGGCCACCTCGATGCC3CCGCGGGCGTCGCCTCGCTGATCAAAACGGTCATGGC 
CCTCGAGCACCGCGAGCTGCCCCCGAGCCTGCACTTCGAGCGTCCGAATCCGAAGCTCGAGCT 
GGAGAGCAGCCCTTTCTACGTCAACACCCGCCTCACTCCGTGGCACGCGGCACGAGGTCCGCG 
CCGCGCTGGCGTCAGCTCGTTCGGCATCGGCGGCACCAACGCGCACGTGGTCCTCGAAGAAGC 
TCCGGCCCCGCCTCCGAGCGGCCCCTCGCGGCGTTGGCAGCTCCTCACCCTCGCGGCTCGCTC 
CGAGGCCGGGCTCGCGCGGGCCACGGCCGACATGATCGAGCACCTCGATCGCCACTCCGGCAC 
ATCGATCGCCGATGTCACGTACACGAGCCACGTGGGGCGCCGGGCCTGGCCCTTCCGGCGAGC 
GGTCGTCGGCGAGAGCGCCGCGGATCTCCGCGCCGCGCTCGCGAGCGAGGGCTCGCCGCGCTC 
GATCTCGTCATGCCAGGCGGCGAGG3AGAGGCCCGTCGTCTTCCTGTTCCCCGGTCAGGGAGC 
GCAGCACCTCTTCATGGCGCGGGAGGTGTACGAGGTCGAGCCGATCTTCCGGCAGTCCCTCGA 
CCGCTGCGCCGAGCTCCTGCGCGGCGCGCTCGGCCTCGATCTGCGGCAGGTCCTCTACCCCGC 
CGAGGGGCAGCGCGACGACGCCGAGCAGGAGCTCGGTAGGACCGCGATCGCCCAGCCCGCGCT 
GTTCGCCATCGAGCTCTCGCTCGCCAAGCTGTGGATGGCCTGGGGGATCGTCCCCCAGGCGAT 
GATCGGCCACAGCGTCGGCGAGTTCGCCGCGGCTTGTCTGGCGGGCATCTTCCGCGAAGAGGA 
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CGCGCTCCGCCTCGTCGCCGAGCGGGGCCGCCTGATGCAACAGATGCCGCCCGGCGCGATGCT 
1Q GGCGGTGCCCCTCGCGGAGCCCGAGCTCGCCCCCTACCTCAGCGACGACATCTCGCTCGCGGC 

GATCAACGGTCCGGCTCTCTCGGTGGTCGCTGGGCCGATCGAGGCCATCGACGCGCTCGCGGC 
CGAGCTCTTGGACCACGGGCTCTCGTGCCGGCGACTCCACACGCGGCACGCCTTCCACTCGAA 
5 GATGATGGCCCCCGTCGTTGACGCCTTTACCCGATGCGTGTCCGCGGTCGAGCGCCGCCCGCC 
IS GTCAGGCCACTTCCTCTCGACCCTGACGGGCGGCTGGATCTCCCCCGAAGCAGCGACCATCCC 

CGCATACTGGGCCCGGCAGCTCGTGGAGCCGGTGCGCTTCGCCCAGGCCGTGAGGCAGCTGCT 
GTCCGAGTCGACGTGGCTCTGGCTCGAGCTGGGTCCGGGCCAGACCCTGAGCCCGCTCGTACG 
GCAGCAGGCCCGCGCGGATGGCGGCCAGGTGGTCGTCGCCTCGCTGCCGCGCGCGAAGGACGC 
GGGCGCCGACCACCTCGCGGTCATCGAGGCGCTCGGCCGTGTCTGGAGCGCTGGTGGGACX3GT 
CGACTGGAAGCGCTTTCACGAGGGCGAGGCGCGGCGGCGGGTGCTGCTACCGACCTACCCCTT 
CGAGCGGCAACGATACTGGGCCTCTCCGCGCCACACGAGCGCTCCGCCGGAAGCGATAATCAA 
25 GCCGCTCCTCGCGAAGAACCCAAACGTCGCCGATTGGTTCTTCCTCCCTGCCTGGCGGCGCTC 

GGATCCTCCGGTCTCGTTCGACGCGCAGGCGGTGACCACGCGGCGCTCTACGTGGCTCGTCTT 
15 CATCGGGGACGAGGGCCTCGGCGCGGCGCTGGTGGAGGGCCTCGCGCGGCGGGGGCACGAGGT 
CGTCGCGGTGGTCACGGGTGAGAGGTTCGAGCAGACGGGCACGCAGCGCTACACGATCGATCC 
30 CGCCGCGAATGGCGATGTTGCGTCCCTCTTCGCGCGGCTCGAAATCGAAGGGCGCATGCCGGA 

CCGGATCGTCCATGCCTTCTGCACGTCGCCTGCGGACGGCGCGCGCATCGAGCGCGGAGCCGC 
GCTG3AGATCGAGCGCAGGCTGGGCTTCGATAGCCTCCTCCTCCTCGCCCAGGTGATCGCCGC 
ACAAAGGCATCCGAAGCCGCTGATGCTCGGCGTGATCACGACCCGGGCGCACTCCGTCATCGG 
AACC3AGATCATCGAGCCCCTGCGCGCTCTGGTGCTCGGCCCCTGCCGCGTCATCCCGCAAGA 
AATACCCCATGTCTCGTGCCGGAACATCGATATCGATCTCCCGGGCGAAGGCGGGCGCGCGGA 
GATCGCGGCGCGCCTGATCGCCGATCTGGAGCGAGAGTCGCCCGACTCGGTGGTGGCCTACCG 
CGGCGGCCGGCGCTGGGTCGAGAGCATAGAGCTCACCGATGTCGGCCGGCGGTCAGCTGGCGC 
CGCCCCGCGCCTCCGCCAGCGCGGGGCGTACCTCATTACCGGCGGCCTGGGGGGCATCGGCCT 
CGTGC-CTGCAGAGCTCTTGGCCCGAGAGGCGCACGCACGGCTGATCCTGGTTGGGCGGACAGG 
CCTC-CCAGCGCGGCAGGGGTGGGACGACTGGCTCGCGGCGCACGGCGCGGGCGACGCGACGAG 
CCGAAAGATCCTCCGGATCCGCGCC-CTCGAGGAGGCCGGCGCCGAGGTGAAGATCGCCGCGGC 
CGAC3TCTCCGATTTCAATGCGATGCGGAGCGTCATCGAGGAGGCCCGGACGCGCTTCGGCCG 
CATC3ACGGCGTCATTCACTCCGCCGGCATCGCGAGTGGAGGCATGATCCAGCTCAGGACGCC 
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GATGGCGGCTTGGCGCGTGATGGCGCCGAAGGTCGGCGGCACGCTCGTGCTCGATGCGCTCCT 

CCGGGACGAGCGTCCCGACTTCCTCCTGATCTGCTCGTCGTTGGCCTCGCTGGTCGGCGGCGC 

CACCCAGATCGATTACTGCGCCGCCAACGCCTTCCTCGACGCCTACGCGCAGAGCCGCGAGGG 

CGAGGAGGGATGCCGCGTCATCTCGGTGCAATGGGACACGTGGAGTGACGTCGGGATGGCGGT 

GGACTTCAAGCTCCCGGCCGATCTCCAAGAGGGGCGCCGCGAGAGCCTGAAGCGGGGCATCAG 

GTCGAGCGAGGGCGCCGAGGTGCTCGGCCGCATCTTGAGCGCAGGCATGAGCGGCCCGCTGGC 

GATTTGCACGTCGGATCTACCAGCGTACAAGCAGTCTGTCACGACACGCCGATCGCAGCACGA 

GCAAACTCCCGCCGCCCGGCCGATGCACTCGCGCCCAACGACCACGGGAGCCTATGTCGCTCC 

CGAGACCGAGACCGAACGGCGCATCGCCGCGATCTGGCAGGATCTCCTCGGCCTCGAGCAGGT 

AGGCGCAAACGACGATTTCCTCCAGCTGGGCGGCCATTCGCTGTTGGCCACGCAGGTCCTGTC 

TCGCGTCCTGCAGAGCCTCAAGGTGGGGATCTCGTTGCCGCAGTTCTTCGATGCGCCGACGGT 

CGCAGGGCTTTCGCGCCTGGTCGACGCAGCACGGGCCGAAGGCGCCGGACCCGTCGCGCCGGC 

AATCGGCCGTGTCGAGCGAGACGCCTACCGAATCAAGCCGCCCGCGGCCGAACAGGCCGCCCG 

CACCAAGCCGTAACAAGAAGGGGATCGAGTCATGGAACCCGTCGGCGGCGTGGACATGAATCA 

GCCCGCAAAGCAGCAGGAGACCTGCGTCTTCCCGACCTCCTTCGCGCAGCGGCGGCTCTGGTT 

CCTCGACCAGCTCGAGCCGGGGAGCGCCGTCTACAACATGCCCGCCTCCTTCCGGACGCGCGG 

GCCGTACGACGTCGACTCGCTCGTGCGCAGCGTGAACGAGATCGTGCGGCGCCACGAGTCGCT 

GCGCACGACCGTCGATGTCATGGATGGCGAACCCGTGCAGGTGATCGCCCCCTCGCTGCGCAT 

CGAGGTGCCCGTCGTGGACCTGAGCGAGATCGACGAGCCGGAGCGAGAGGCGGAGGCCCGGCG 

GCTCATGGCGGAGGAGAGCCGCCGCCCCTTCGATCTCACGCGAGGGCCGCTGCTCCGAGCCAA 

GCTGCTCCGGCTCGGCGAGGCCGATCACGTGCTGATCTTGACGATGCATCATATCGTCTCCGA 

CGGCTGGTCGATGGACGTGCTGTTCAAGGAGCTTTCCACGCTCTACGCCGCCTTCCACGAGGG 

CCGCCCGTCGCCGCTCCCGGAGCTGCCGATTCAATACGCCGACTTCGCGGTGTGGCAGCGGGA 

GCTGCTCCAGGGCGAAGTTCTGGAATCGCACCTCGGGTACTGGAGAGAGCACCTCCGCGGCGC 

CCCCACGCTGCTGGAGCTTCCGATGGACCGGCCCCGGCCGCCGGCGCAGACGTTCCGGGGCTC 

CCAGCGCGCGTTCCGACTCCCACTCTCCCXGCAACAGGCGGTGCAGGCGCTCAGCCGGCAGGA 

AGGCGCGACCCCCTTCATGACGCTGCTGACGGCGTTCAGCGTGCTGCTCTCGCGTTATGCGCG 

GCAGAGCGATCTGGTGGTTGGCACGCCCATCGCGAATCGCACCCGAGCAGAGCTGGAGGGGCT 

GATCGGCTTCTTCGTCAACATGCTGGCGCTGCGCATCGACCTCGGGGGCGACCCGAGCTTCCG 

CGAGCTGCTCGGGCGGGTGCGGGAGGTGACGTTGGGCGCCTACGCGCACCAGGACCTGCCCTT 
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CGAACGGCTGGTGGAGGAGCTGTCACCAGGGCGGAGCCCCAGCCACAGCCCCTTGTTCCAGGT 

GTCCTTCACGTTGCAGAACACCCCGATGGATGCGACGAACAGAGCAGACATTGCATCGGGTGG 

CGCGCCGCTGGTGGAAATGAAGGCGGCGAAATTCGATCTGATCCTGGAGCTCTCGGAATCGCC 

GCAAGGGTTGCTCGGCACGTTCGAGTACAACACCGACCTGTTCGACGCCGGCACCATCGAGCG 

GATGGCCGGCCACCTGGAGGTGCTGCTCTCCAGCGCCGTCGCGGCGCCGGATCGACCCATTGC 

GGAGCTGCCGCTCATGGGGGCCGAGGAGCGCAGTCGGGTATTGGTGGAGTGGAACTCCACTGC 

CGCGCTGTATCCCGAGGACCATTGCATGCACGAGCTGTTCGAGCAGCAAGTGGAGCGGTCGCC 

CGAGGCGACCGCGGTGCTCCTCCAGCAGCAGACGTTGACGTATCGAGAGCTGAACATGCGCGC 

CAATCAGCTCGCGCATCACCTGCGGAGCCTGGGCGTGGGCCCAGAGGTGCGCGTCGGGTTGTA 

TCTCGAACGGTCAATCGAGACGGTCGTGGCGATCCTCGGCGTGCTCAAGGCTGGCGGGGCCTA 

CGTGCCGCTCGATCCGACGTACCCCAGCGAGCGCCTCGGGCTCATGATGGCGGACGCAGCGCC 

CTCGGTGCTGCTCACGCAGGCGTCGCTCCTCTCGAAGCTGCCGCCCCACGGGGATGCAACGCT 

GGTACAGCTCGACGCGCTGCACGAAGCGCTCTCCAGGCTGCCACACCATACCCCGCGGAGCGG 

CGTCACCGCCCAGAACCTCGCATACGTCATGTACACTTCCGGCTCGACCGGGCGGCCCAAGGG 

CGTGCTCGTCGAGCACCGCGGCCTCTGCAACCTGCCCACCGTGCAGGCCAAGCTCTATGGAAT 

CGCGCCGGGCGACAGGCTCCTCCAGTTCGCGCCGCTCTGCTTCGACACATCGTTCTGCGAGAT 

CGCGCTCGCGTTGCTCTCGGGAGCGACGCTGGTCATGGGCACGGCGGACGAGCTTCTCCCGGG 

ACCTCCGCTGGTCGAGCTGCTGAAGAAGCACGCGGTCACGGCGATGCTCCTGGCCCCTACCGT 

GCTCGCAGCGCTGCCAGAACAACAGAGCGCGGCGTTGCCGCTGCGCGTGCTCACGATGGCCGG 

TGAGGCGTGCCCGGCGGAGCTCGTCAAGCGCTGGAAGGCACCCGGACGGCGCCTGTTCAACTC 

CTATGGCCCGACCGAGACGACCATTTGGGCAAGCTCCGCAGCGGACCTGTCCGACGAACGGAT 

CCCGCCCATCGGCCGTCCGATTGCCAATACGCAAATCTACGTGCTCGACGAAGCGCTCGAGCC 

GGTGCCCATCGGCGTGCCGGGCGAGATCTTCATCGGCGGCGTGGGCGTCGCCCGGGGATATCA 

CGGGCGTCCGGACCTGACGGCCGAGCGATTCGTACCCGACCCCTTCGGGCAAACCAAAGGGGC 

GCGCCTGTATCGGACCGGCGATCGGGCGCGCTGGCTGCCGGACGGAAACCTCGAGTTTCTCGG 

TCGAAACGACGAGCAGGTGAAGGTCCGCGGTGTCCGCATCGAGCTGGAGGAGATCCGCGCGGC 

GTTGCTCAAGCACCCGGCGGTCGCTCAAGCCGTGGCCGTGGTGCGCGAGGACACGCCGGGGGA 

CAAGCGGCTCGTCGCGTATGTCGTCGGACGCGGAGGAGCGCGCGTGACCGCCGCGGAGCTGCG 

CCAGTCCGTGAGCGAGCGATTGCCTGCGACCATGGTGCCATCGTCCTTCGTGGCGCTCGACGC 

CTTGCCCCTGACGCCGAATGGCAAGGTGGACCGCCGCGCGCTGCCGGAGCCCGAGCAGAGCGC 
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CC3GCGGCGAGGACCACGTCGCGCCGCGCAACGCCGTCGAGGAGGAGCTCGCCAGGATCTGGGC 

GAGCGTCCTCCGGCTCGAAAGGGTCGGCGTCCACGACAACTTCTTCGAGATCGGCGGCGACTC 

GATCCTGAGCATCCAGATCGTGGTGCGCGCGCAGCAGGCAGGGCTGCGCCTCACCCCGCGTCA 

GATGTTCCAGCACCAGACCATCGCCGAGCTTTCGACCGTGGCTAGAGCCGTCGAGGCGGTCCA 

CGTCGAGCAGGACCCGGTGACCGGTCCCGCGCCGCTCACGCCGGTGCAGCGCTGGTGGCTGGA 

GCAGGAGGCGGCCGAGCCGCACCACTTCAACCAGTCGATCTTCCTCGAGGTACGCGAGCGGCT 

CGACGAGAGCGCGCTGGAGCAGGCCATCGCGCATCTGATCGACCACCACGACGCGCTCCGGTT 

GCGCCTCGCGCGCGACGAACGCGGCGCCCACCAGGTCTTCGCCGCGCCGGGAGGCTCGACCCC 

ATTTCAGCGCGTCGACCTCGGGGCGCTGCCCAGCGCCGAGCAGATCTCCGCCATGGAGAAGGC 

CGCGAGCGAGGCGCAGGCGAGCCTCGATCTGGCCGCGGGCCCGGTCGTCCGCGCCGTGCTCTT 

CGACCTCGGCGAGGTCGCCCCGCAACGGCTGCTCGTCATCGCCCACCATATTGCGGTCGACAG 

CGTCTCCTGGCGGATCCTGCTCGACGATCTCTTTGGGGCCTATGAGCAGGCGCGCCGCGGCGA 

GGCCGTACGCCTGCCGCCCAAGACCACGTCGGTCAAGCGCTGGGCCGAGCTGCTGACCGAGCA 

CGCCGGCTCCGAGGCCGTCAAGGCGGAGCTCGGCTACTGGCTCGACTCATCGCGACGAACGGT 

AGCTCCGCTGCCCGTGGATCGACGGGCCGGCGAGGACGTGTGGGGCTCGGCGCGCCACATCGT 

CGTCTCGCTCACGCCGGAGCAGACGGAGCAGCTCCTGCGCGAGGTGCCGCAGGCGTACCGCAC 

ACGGATCGACGACGCGCTCCTCACTGCGTTCGCGCAGGCCATCGCTCGGTGGACGGGCTCGCC 

GGCGGTGCTCCTCGACCTCGAGGGTCACGGGCGCGAGGAGCTCGCCGGCGTAGACCTCACGCG 

CACGGTCGGCTGGTTTACGGCCATGTACCCGATCCTACTCCGCGTCGACGCGGCGGATGCGGG 

TGAGGCGCTCAAATCGATCAAGGAGCAGCTCCGCGCCGTGCCAGGCCGCGGGCTCGGCTACGG 

CTTGTTGCGTTACCTTCGGTCCGATACCATCGCCGAGGTCCGCGCGTTGCCGCAGGCCGAGCT 

CTGCTTCAACTACCTCGGCCAGCTCGATCAGGCGATCCCCGAGGCTGCACCGTTCCGGCCGGC 

GCGCGAGTATCAAGGCTCGGAGCGCAGCCCCGGCGCCCATCGCGCCCACCTCATCGAGGTGAA 

CGCGAGCATCGCCAATGGGCGCCTGTACGCCACGTGGACGTACAGCGAGCGCCGCCACGAGCC 

CGAAACCATCGAGCGCGTCGCGGCGAGCTTCGTCACGGCGCTCCGCGCGCTCATCGCGCACTG 

CACCTTGCCCGAGGTCGGCGGCAACACGCCTTCCGACTTCGACAAGGTGCGCCTGCGCCAGGA 

GACCATCGATGCTCTCGACGCAATCGACGCGGGCCCCGGGCCGTCTGCGAGGGGGAGCCGAAT 

CGAAGACGTCTACCCGCTCTCGCCGCTCCAGGAGGGCATCCTGTTCCACACGCTCTACGCCAC 

CGATTACACGGCGTATGTCGAGCAGTTCCACTGGACGCTGGAGGGCGATTTCGACGCCGAGGC 

GTTCACCCGCGCCCTCCAGGACGTCGTCGCTCGGCATGCCGCCCTGCGCACGTCGTTCGCCTG 
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GGAGCGCCTCGATGCTCCACTTCAGATCGTCCGCACGGGCGCGGTCCTCCCCGTCGAGCACCA 

GGACCTACGCGGCCTCGCCGCGGAGGAGCAGACCGCGCACATCTCCCGTTACGTCGAGGCAGA 

GCGCCAGCGCCGGTTCGATCTGCGAAAGGCGCCCCTCATGCGCGCCGGGCTGCTCCGGCTCCG 

CAAGGACGCCTGGTGCCTCGTCGAGACCATCCACCACCTGATCCTGGACGGCTGGTCGACACA 

AATCTTGCTCAAAGAAGTGTTCACGCTCTACGAGGCGCACCGCGGACACCGTGGGCATCTCGC 

GCTGGAGCTCGAGCAGCCGCGGCCCTACGGCGATTACATCGGCTGGCTCGCGAAGCAGGACCA 

GGTGCGCACCGCGGCCTTCTGGCGGCGCGAGCTCGAGGGCTTCTCCGCGCCGACGCCGCTCGG 

CGTCGACCGCGCTGTGCCGCACGACGACGGCGGCCCGCGGTTTGGTTGGCGCCGCATCGCCCT 

CTCGGGCGACGACGCGGCCCGGCTCGCCGCCTTCGCGCGTCAGCATCAGCTCACGATGAGCAC 

GCTGGTGCAAGGCGCGTGGGCGCTGCTCTTGTCACGCTACAGCGGCGATCCCGACGTGCTCTT 

CGGTATGACCGTCTCGGGCCGCTCGGCGCCGATTCCCGGTATCGAGCGCATGACCGGCCTCTT 

CATCAACACCATTCCGGTGCGCGTGCGCGAGCCTGCCGACGCGTCGGTGCTCGCGTGGCTCAA 

GGCGCTCCAGGAGCACGAGGCAGAGCTGCTCGAGCACGAGCACAGCCCGCTGGTCGAGGTCCA 

GGCCCATAGCGACGTGCCGCGCGGGACCCCGCTCTTCGAGAGCCTCGTCGTGTTCGAGAACTA 

CCCGGTGCAGGTCATCTTCGAGGCCCCTCCGGTCGAGGGGCCGACGCGCGCGGAGGAGGGCCT 

CCGCATGATCGATGCGCAGTATATCAGTGATCCACCGTATCCGCTGACGGTCGTCGCGGCCTT 

CCATGGGACGCTTTATCTCAATATTGGCTACGAGCGCCGCCGGTTCGACGACCAGGCCGTCGA 

ACGGATGATCGGGCACGTCACGACGCTGCTCCGGGGCTTCGTGCAGAGGCCCGAGACGTCGGT 

CCGCGATCTGCCGTTGCTGACGGCCGAGGAGGAGCGCACCCAGCTCCACGCGTGGAATGCCAC 

GC-CCGCGCCGTATCCCGAGGGCCATTGCATGCACGAGCTGTTCGAGCAGCAAGTGGAGCGGTC 

GCCCGAGGCGACCGCGGTGCTCCTCCAGCAGCAGACGTTGACGTATCGAGAGCTGAACATACG 

CGCCAATCAGCTCGCGCATCACCTGCGGAGCCTCGGCGTGGGCCCAGAAGTGCGCGTGGGCTT 

GTGTCTCGAACGGTCGATCGAGACGGTCGTGGCGATCCTCGGCGTGCTCAAGGCAGGCGGGGT 

CTACGTGCCGCTCGACCCGACGTACCCCAGCGAGCGCCTCGGGCTCATGATGGAGGACGCGGC 

GCCCTCGGTGCTGCTCACGCAGACGTCGCTCCTCTCGAAGCTGCCGCCCCACGGGGATGCAAC 

GCTC-3TACAGCTCGACGCGCTGCACGAAGCGCTCTCCAGGCTGCCACACCATACCCCGCGGAG 

CGC-C3TCACGGCCCAGAACCTCGCATACGTCATGTACACTTCCGGCTCGACCGGGCGGCCCAA 

GGC-CGTGCTCGTCGAGCACCGCGGCCTGTGCAATCTGCCCACCGTGCAGGCCAAGCTCTATGC 

AATCGCGCCGAGCGACCGGCTCCTCCAGTTCGCGCCGCTCTGCTTCGACACATCGTTCTGCGA 

GATCGCGCTCGCGTTGCTCTCGGGAGCGACGCTGGTGATGGGCACGGCGGACGAGCTCCTCCC 
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GGGACCTCCGCTGGTCGAGCTGCTGAAAAAGCACGCGGTCACGGCGATGCTCCTGGCCCCTTC 

GGTGCTCGCAGCGCTGCCAGAACAACAGAGCGCGGCGTTGCCGCTGCGCGTGCTCGCGATGGC 

CGGCGAGGCGTGCCCGGCGGAGCTCGTCAAGCGCTGGAAGGCACCCGGACGGCGCCTGTTCAA 

CTCCTATGGCCCGACCGAGACCACCATTTGGGCAAGCTCCGCAGCGGACCTGTCCGACGAACG 

GATCCCGCCCATCGGCCGTCCGATTGCCAATACGCAAATCTACGTGCTCGACGAAGCGCTCGA 

GCCGGTGCCCATCGGCGTGCCGGGCGAGATCTTCATCGGCGGCGTGGGCGTCGCCCGGGGATA 

TCACGGGCGGCCGGACCTGACGGCCGAGCGATTCGTACCCGACCCCTTCGGGCAAACCAAAGG 

GGCGCGCCTGTATCGGACCGGCGATCGGGCGCGCTGGCTGCCGGACGGCAACCTCGAGTTTCT 

CGGTCGAAACGACGAGCAGGTGAAGGTCCGCGGTATCCGCATCGAGCTGGAGGAGATCCGCGC 

GGCGTTGCTGAAGCACCCGGCGGTCGCTCAAGCCGTGGCCGTGGTGCGCGAGGACGCGCCGGG 

GGACAAGCGGCTCGTCGCGTATGTCGTCGGACGCGGAGGAGCGCGCCTGACCGCCGCGGAGCT 

GCGCCAGTCCGTGAGCGAGCGATTGCCCGCGACCATGGTGCCGTCGTCCTTCGTGGCGCTCGA 

CGCCCTGCCCCTCACGCCGAACGGCAAGGTGGACCGCCGCGCGCTGCCGGAGCCCGAGCGGAG 

CGCCGGCGGCGAGGACCACGTCGCACCGCGCAACGCCATCGAGGAGGAGCTCACACGAATCTG 

GGCCGACGTACTTGGGGCAAAGCGGGTCGGTGTGCACGACAATTTCTTCGATCTCGGCGGCCA 

TTCCCTGCTGCTCGTCCGGGTGCATGATCGGCTCGGCCAGCGGTTCGATCGGCCGCCCTCGAT 

GGTCGACCTCTTCACCTATCCGACCGTGGCGTCGCTCGCGCGGTTCCTTGGCGAACGGGCGAA 

CGGCAAGCAATCGCCGAGGGAGGCCGCGGCGGACGTCACGGAGCGCGGCCGGCGCCGCCTGGA 

GGCGCGGGCGCGGCGGGCGAAGGCCATCCGTGGCCCGACCTGACCCGGGCACCCTTCCAAGCC 

CCGCCGTTCCTCGCACATCCGCCGCCTCGAGCGCCGCGTCCAGCGCCGCCGTTCGCCGACGAG 

GAGGCGCGAGACGACGGTCCAAGGCCTTCGTGGGCTCTTTGCCCCGCAATCCGGAAGCTGCGC 




CTTGCGGCCCGTCCGCGATTGACCTCACATCCTGATCCCTTCTTGCGTCGTCCAGAAAGTGAT 

TGACGGCCAGCGCCGCGCTTGAGATCTTCCGGCGCGCGGCGATTTCATCGCTCCGGCGCGCCG 

TGACTGTCACCTGCGAAGGGATTATAATGAAACATAACATTGGGTGGCTTCTACCCGCCGCCC 

TCGCGACGCTTGCCTTCGTCCCGGCCTGCAGCCCGAATCACGGTGAGGATGCGCCCTCCGTGA 

CGTCAGCAGAGAGCGGCGCGGCGCCGAGCGCTGACTGCGTCGCGCTCGGGGCGAAGCTCCAGG 

CGGCGCTGGACGGCGCCGCCGCCGCGCAAAAGGCTCCGGGAGCCGCAGCGGCGGTCCAGAGCG 

GGGACTGTGTCTGGCGGGGCGCCACGGGCGTCTCGGACCTGGTCGCGAGCACGCCGACGAAGC 

CTGGAGATCTCTTTCGGATCGGCAGCATCACCAAGACCTTCGTCTCTACGCTGATACTCATGC 
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TCCGGGCAGAAGGCCGGTTGTCGCTCGACGACGCGGTGTCGAAGTATGTGAAGGGCATCCCCG 

CCGGCGACCAGATGACGCTGCGCCAGATCCTCGGTCACACGAGCGGGCTCTTCGATTACACGT 

ACAGCCCGGCGCTCGGCCAAATGATCGAGGTGGATCCGACCCGCGCCTTCGCGCCGGCAGAGC 

TCATCGCCCTCGCCACGGCCGAGGCGCCGTATTTCGCGCCGGGCGCGGGTTTTCGCTATTCGA 

ACACCAATTACATCGTGGCCGGCCTGGTGGCCGAGGCGGTGTCGGGCGGGACGCTCGCCGGGC 

TGCTCCGCACGCGCATCCTAGACCCTGTGGGCCTCGCGCACACGTATCTGGACGGCGCCGAGC 

CGCCGGTCCAAGGGCTCATCCGCGGCTACGGCGACTACGGCGCGGGCTTGGTCGACATCACCG 

ACCAGCTGTCGCCCACCGAGGCGTGGGCCGCCGGCGCCCTGGTGTCGAACGTCGATGACCTCA 

ATCGCTTCTTTGCCCTGCTCATCAGCCACGAGCTGCTCTCGTCGGACGAGCTTCAGGACATGA 

CCACCTGGACCCCGACGATGTGGCCCCACGAGCCCGGATATGGCCTCGGCCTCATCGAGCGCG 

ATTCTGCGCTCGGCTCCCTCAACGC-3CACTGCGGAATCATCTGGGGCTTTCAATCGGCGTCGT 

ACGGGGTGCCCGGCCGCGGCGACGCGATCACCGCGCTCATCAACCGGAGCGACGGCGACGCAG 

CGCGGCTCGTCGACGAGCTCGCGAAGGTCGTGAAAGAGCGCTGATCGAGGCGGAATGGGAGCG 

CTTCGGCGGGTGGTGATGGCGCCCGGCGCTCAGAACGCGACGCGCAGCCCCGCGCTCAGCGGG 

CCTGCGCCGGGCGACGCGGCCACGGCGCCCGGACCGACGAGGAGCCGCGCGACGGCGGGCGCG 

CTCGGCGCGTCGTCTCGCCGCACCCGCCGCTTGCCGAACACGTAGAGCGGCAGGCCGACGGCG 

ACCCCGGCCACCCCGCCGAGCGCGGTGGCGATCGCCACCTCGGACGCCTCGGCGCGCGCGGCG 

CTGCTGTCGTCGTGGCTCGCGAAGACCAGCACCGCGCCGCTGAGGATGGCGGCGCCGCCCAGG 

GTCGTGAGGACGAGCCCCGAGATCACCATGACCGGGCTGTTCCACTCCGTCGTCCGCTCCTCG 

AAGTCGCGGAACGCCGCCCTCGCC3CCGCGAGCTCCAGCTCGATCCGGCGCTGCTCGGCGCGC 

CGCTCGTCCGCGGAGCCGATCTCC-TGGACGCGGCGGGCCTGGCCCTCCAGCGCCGCGATGCGC 

GCCTCGTGCGCGGCGGCGGTCTCCTCCCACGTGGCCCCTGGCGGGACCCCCGCCACGGCCGGC 

GCGACAGAGGGCGCCGACGCCGGG3TCGAGGCGGGCGCCGCGGGCGGCTCCGCGGCCACGGAA 

GGCGCCGCCGCCGCGGGAGGCGCGGGCGGCTCCGCGGCCACGGAAGGCGCCGCCGCCGCGGAA 

GGCGCGGGCGACTCCGCGGCCACGGAAGGCGCGGCCGCCGCGGGAGGCGCGGGCGGCTCCGCT 

GCAACGGAGAGCGCGGGCGCCGCGGCAGCCAGGCCCAGCGCCCACGCGACGACACGACGGCGC 

GCCGCAACCGCGCGCGGGCGCGCGAAGCGGAGGTGGACCTGCTCCATGCGCGCAGCGTCGCCC 

CTCGACAGGGCCGGGTCAAGGCGC3GGAGTCCGAGAGCACGAGACCTCCGCGCCGCAGGAAAC 

AGGCGCGCCGGCGGCCCGCGCGGC3GCTCGCCGCTCACCCCTCGCGCGGCCGGCCGCGGCGCC 

GCCTCCCCTCCCCGGCGGGCCGCGCGTCGGCGGCCACGCGGAGCAGCTCCTGGAAGTGCCGCT 
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CCACCGGGCCGAGGTCGATGCCGTCCATGAACGACGTGAACGCGAAGTACGGCAGCAGCGTCT 

GCCAGGCGCGCAGCCAGCCCGGGAG3TAGCGCGGCCGCTCCAGCACGCCGGCGGCGCGCAGCT 

CCGCCAGCACCGCGATGTCGTCGAGCGCGATCCGGCCGCACACGAGCAGCTCCACCTCGTCGC 

GCAGCCCGCCGCGAAGGACCGCGGCCAGGAAGGCGTAGACCTCGAGCAGCCCGGCCAGGTAGC 

AGGCGTCCTTGGTGAACGGCGCGCCGCCCTCGACGAGCCCGCCGCGGCACACGCGCTGGGCGT 

CGAAGTAGGCGTCGCGGCGCTCGGCGCCGCGCTCGCGCAGGTGCCGGTACAGGTCGAGGAAGC 

TCGCGCCCTGCTCGGCCATGTCCACGAGCCGCACCCGCTCGGCGAGCCGGGTGAGGCGGCCGA 

TGGAGAGCGAGCGGCTGTAGAGCTCGGCGAAGATGGCGAGCCCCTCCTGCGTGCGCGTGGTGC 

GGGGGCCGCCCGAGCGCAGGAACGCGCACCGCGGCTGCGCCGCGCCGTTGTGCGCGGTGAGCG 

CGTGCGTCTCGACCTCGTGGTGCCACAGCCCCTCGGCCTCCCACGCCGCGAAGGTCGCCTCCG 

GCCGGATGCGGACCCGGCTCATGCCGGCGACCACCTTGGCCGTGACGCGCGGGTCGACGGTGA 

TCTCGAGGTCGAGCCGCGGCGCCCC-3CCGGCCACGCGGGCGGCGAGCATGTCCCGGAGCGCGC 

CGGCGTCGAGCGGCTCCTCCTCGGGATCGCTGGCCTCGTCCCAGCCGTGGACGCGCAGGCGCT 

CGGTGAGGTGCTCGGCGAGGTCGATGTTCCTGAGCGAGCCGCCGAAGAACCGCGAGCGCGCGC 

CGCCGTAGAGCTCCTGCGACCGCGCGGAGAACGCGCGGGTGCCCGCGGCCTCGAGCAGCTCCG 

CGGCCTGGATCTGCGCGCGGACGTTGTCCCGCAGCCAGCCGAGCGCCGGCGCGTCCCCGTCGA 

TGGCCCCGAGGAGCTCGCGCAGCTCGGCGACGCGCCGCGCGAGGCCGTCGCGATCGACGCGGT 

ACTCGACCTCGGGGAGGCGGTCC7CGCCGGCGGCGAAGAAGCGCTCCTCCACCTCGCGCGGCC 

AGGCGATGTCCTCGAGCAGCTTGAGGGCCTTGCCCTCCGCCAGGCGGCCGCCCACCCGATCGA 

GCTGCTCCAGCACGGCGCGGTCGAT3CTCATCGAGCGCAGGATCGCCGAAACCGCGAGACGCC 

GGAACCGTCATTCCCTCGACGAGGCAGCGATTGCCATGTTCCGTCGCTTTTTGGAGCGCCGTC 

GTCGCGCTCGCCTGCGGGCTCCGGCGATCCAGCGCGGTTGCATGCAGCGAGGGTGTTCCGGGG 

CTGGCTCGAGAGCGTCCTTTGGCCCACACCCGAGACACGAATGCTCCGCGCCGAGCGCGGTTG 

ACCGTGGACCCGCCGGAGAGCCGATGATACGGTCCGGCCGATGTCGGAGAGTGTAGCTCAACT 

CGAAGAACACCGCGCGGCGCTCACC3GACACTGCTACCGGATGCTGGGTTCGGTGGTCGACGC 

CGACGACGCCGTCCAGGAGACGATG3TGCGCGCCTGGCGGAGCCTGGATAAGTTCGACGGGCG 

CTCGTCGCTGCGCACCTGGCTGTACCGCATCGCGACGAACGTCTGCATCGACCTGCGGGCCGA 

CCGCGCGCGCCGGGCGCGCCCCATC3AGGAAGGCCCGGTCGGCACGGTGGACGACGCGCTCGA 

GACGCGCCCGCGCACCCACTGGCTGGAGCCCGTCCCCGACGCGCACGCCCTGCCGGCGGACAT 

CGACGCCGCGGAGCGGGCGATGCTCC3CCAGAGCATCCGCCTCGCGTTCGTCGCGGCGCTCCA 
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GCACCTGCCGCCGAAGCAGCGCGCCGCGCTGCTGCTCACGGAGGTGCTCGGCTGGTCCGCCGC 

GGAG3TCGCCGACAGCCTCAACACCTCGGTCGCCGCGATCAACAGCGCGCTCCAGCGCGCGCG 

GGCGACGCTCGCGAGCCGCGATCTCGGCGACGCGCGCCCCTCGCTGCCGGAGCCGCAGTCCGC 

GCTGCTCGACCGCTACGTCAACGCCTTCGAGCGGTACGACGTCGACGCGCTCACGGCGCTGCT 

GCACCAGGACGCGACCCTGTCGATGCCGCCGTTCACCCTGTGGCTCCGCGGCCACGAGTCGAT 

CCGCGCCTGGCTCGTGGGCCCGGGAGCGGGCTGCCGCGGGTCGCGGCTCATCCCGACGGCGGC 

GAGCGGCTCGCCCGCGTTCGCGCAGTATCGCCCGGCGCCGGAGGGCGGCCACCGGGCCTGGGC 

GCTCATCGTCCTCGACGTCGCGGGGGACCGCATCGTCAGCATGACGTCCTTCCTCGACACCGA 

GACGCTCTTCCCGCGGTTCGGCCTGCCGCTCGATCTACCGGCGTAGCCGCGGGCGCCCTGCCT 

GCCTCGCCGCGGGTGCCCTGCCTGCCTAGCCGCGGGCGCCCGGCCTGGCCACGGGCGCCCGGC. 

CTG3CCACGGGCGCCCGGCCAGCGACGGGGCGACGATTTTTTTCTGAGCGACCGATGAGTCCT 

GACGGGGCCGGGGGTCTACGGGGGTGAATCCAACACGGAGGCACCCATGACCGTGACCATCGC 

CAGCATCGATCATCGTGACCAGGACCTCATGACCGGGCCCCAGGCCAAGGCGCCGGCCCGCGC 

GGCGGCGCCCGACGCGGCGCCGTCCAGGCGAGCCGTGTGGGCGGGCCGCGTCCTGAGCGGGCT 

GGCCACGCTGTTCCTGACGTTCGACGCCGCGGTGAAGGTGCTGAAGCTGTTCCCCGCGGAGGC 

GTCGACCGCCGAGCTCGGGTTCCCGGCGCACCTCGTCCCCACCCTCGGCTACCTCCAGATCGC 

TTGCCTCGTGGCCTACCTGATCCCGCGCACCGCGGTGCTCGGCGCGATCCTGTGGACCGGCTA 

CCTGGGCGGCGCGATCGCGATCCACGTGCGGGTCGAGAACCCGCTCTTCAGCCACACGCTCTT 

CCCCATCTACGTCGCCGCGTTCCTCTGGGCGGGGCTCTGGCTGCGCGACCGCCGCGTGCGCGC 

GCTGACCGCGAGCCCGTCGTCGCAGGGCCGATGAGCTTCACGTTTCACGAGAGTCCATCACGG 

TAAAAGGAGAAGCGAGCCATGACCACAAAGAACCCCCGCAAGCTCTTCGTCAACCTGTCCGTC 

CGCGACCTGAAGCGATCGATGGAGTTCTTCAGCAAGCTCGGGTTCGAGTTCAACCCGCAGTTC 

ACGGACGAGAAGGCCGCCTGCATGGTCGTCAGCGAGGAGGCCTATGTCATGCTCCTCGTGGAG 

TCGTTCTTCAAGACGTTCATGAAGAAGGAGATCTGCAGCACGAGCACGCACACGGAAGGGCTC 

TTCC-CGCTCTCGTGCAGCAGCCGGGCCGAGGTCGACGACATGGTGAAGAAGGCGGTCGCGGCG 

GGCGGGTCGCACGCGATGGATCCGCAGGATCACGGCTTCATGTACGGGTGGAGCTTCTACGAC 

GTGGATGGCCACCACTGGGAGGTCATGTGGATGGATCCCAAGGCGATCCAGCCGTAGCCGACG 

GGGCTGGGCGCGCCGCCTGGAAGAGCCCCCGTGAGGCGGGGAGGCGGGAGGATCACCGTCTTC 

GTA3CCCACAGCGATGCAGTATCCGTCGCGCTTCGTATCGAAGCACGGCTGTTACGGGCGCGT 

CAGAGCGCGTCGCAGGTGATGCCGAGCCGCAGCAGCGACACGGGCACGAGCGTGGCTCCGATG 
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GAGATGAGCCGAGTCTCGCCCATGGTCTCGGGGTCATGAATGGATGAGTAGGGGACTCGCTGC 

TTCGTCACGTCGTGCTCGACGGCGACGGCGAGGCCGAGCTCGAAGTGCACGGGGCCTGGACCG 

AAGATCCAGCTCGCCCCGGCGCGAGCCCCGACGAAAAGCGTGTCGCCGTCGACGCCAGGGCCG 

TCGTCCCAGCCGGGCGATCCCACCGCGGTGTAGGTGTGTTTCCCGAAGGAACCCGCGAGCGAG 

AGTCGAAGTCCGACCGGCGCTCGCCACGCGACGCCCGCTGTCGCGCCGACGCCGCCGAAGCTC 

TCCCCGAAAGGCTTATCCCCTGTCTCGATGAAGCCACCCACCTCGATGACGCTGATGCGGTAC 

GTGAGCGCGAGATTGAGGTGCACCCCAGCGCTGTCCGAGCCCGAGTAGAGGCCGGCGCCCACC 

TGCACGCTGAAATCCATGCTCGGCGCGGATCCGCGCGCAGGAGCGACGCCAGGGGCGCTGCCC 

TCCTGCGCGCGGGCCGTCCCGACGCAAAGAAAGAGGGCTGTCGCGAAGAATCCAAGCGAGATC 

GATCGAAGTGAGCGCATGTCGGGCCCTGGAGCATCCGCTGTACCAGGTGCGTCGTATTCATGC 

GGCGCGCCGCCGGGCGCCGCCGCGCTGGCCTGTCCGACGCGAGATCACGAATCCGCCATCGCT 

CCCCTGGGCCGCCGGCCGCTCTGGTTCGCCTGCGGGCGTGCGCCGGCGCTCGTGTGGCCCATG 

GCAACCTTGTCGCGGTGTCGCTCGAACAGCACAGAGAGTATCGCGTCCGCAACAACCGCGCGA 

CCCGGCGAGACGCTCGTGGGGCCCCCTGCCTCCCCACTTCATCATAACGCCATCAGGAGCACT 

CGACATTTCATTTCTTCACCTCCACTGGCTGAGGGCGACGGTGCTCGTCATCGGCCGGTTGCT 

CTGGCGGTTGCTCTGGCGGGGTTTCTGACGCCCGGAACTAACGCTTCGAGCGCTCCCCCTTGC 

TCTCCCGTTCCTTCAGCTCCTCCAGCAGGTCGTCGAGGCGCTCGTAGCTGCCTTCCCAGAAGC 

GGCGGTAGTTGTCGAGCCAGCCGCTGGCGTCCTCGAGCGGCTTGGCCTCGATCCGACAAGGCC 

TCCGCTGCGCGTCGCGGCCGCGCGAGATCAGGCCCGCTCGCTCCAGCACCTTGAGGTGCTTGG 

AGATCGCGGGCTGGCTCATCGCGAACGGCTTCGCCAGCTCGGTCACCGACGCCTCGCCGGACG 

CGAGGCGCGCGAGGATCGCTCGCCGTGTCGGATCGGCGAGCGCAGCGAACGTTGCGTCGAGGC 

GCTCGGACGGGGTCATTGCATAACTCCTTGGTATAAAAACCAGTTAGTTATACAACCTGGGGC 

CCGGGCGGTCAAGCCTCCAGGCGATGGCGGTTCGGCCCGGGGGCTCCGCTCGCGGCACGCGCG 

CCGCGCGGCTACGTGCGCGGCGCGGTGAGCACGTCCTGCAGCGTGGCGCCGACCACGGGCTTG 

GTCAGGTGCAGGTCGAAGCCGGCCCGCCTGGACCTGGCCTGATCGTCGGGCCCGCCGTAGCCC 

GAGAGCGCCACCAGGTAGAGCGCTTCGCCGCCGGGCGCGGCCCGCGCCCGGCGCGCGACCTCA 

TAACCGTCGATGCCGGGCAAGCCGATGTCCACGAAGGCCACCTCGGGGCGCAGCTCCAGAAGC 

TTCTTCACGCCCTCCAGCCCGTCCACCGCCACCGTCACCTCGTGCCCCAGCGCCTCGATGTAC 

GCCCGCATCACCCGGCGCACGTCCTCCGCGTCCTCCACGACGAGCACCCGGCGCCGGTCAGCC 

GCCGCCTCGGGCGCCTCGGCGCGCTGCGCCGGAGGCGGCGGCGGCTCGTCGCGCTGCGCCGGA 
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GGCGGCCCCTCGCGCGGCGGGGGC33CCCGGCGCTCGGGGCAGGCTGCGGCGCCGCCCCGGGG 

CCGAGCGGCAGGCGCACGGTGAACT2GCTGCCCTGGCCCGGCCCGGCGCTCGCCGCGGCCACG 

CTGCCGCCGTGCAGTTCCAGGAGCC3CCGCACCAGCGTGAGCCCGAGCCCCAGCCCGCCCGTG 

CTCCGGTCGATGGTCTGGTCGACCrGCGTGAACAGATCGAACACCTTCTCGAGCATCGCCGCC 

GGGATGCCGCGGCCCGTGTCGCGCACCCGCAGCACGGCCTCGGGCGCGCCGACCGCCGCCTCG 

CGCGTGAGGCGCACCGAGATCGAGCCCCCCGGCGGGGTGTACTTCGCGGCGTTGGTCAGGAGG 

TTCGTCACCACCTGCTCCAGCCGCC-TCGCGTCGGCCCGCATGCCGAAGTCCCCGGGCCCCACC 

GACAGCGACACGTCATGGCGCCGGC-CCTCGACGGCCGGCCTCACCGCGGCGGCGGCGCTCTGC 

ACCACCGCCGCGAGATCGACGTCCrCGAGGCGCAGCTCCACCGTGCCCCGCGTGATGCGCGAC 

ACGTCGAGCAGATCGTCGACCAGCC3CACGAGGTGGCCCATCTGCCGCCGCGCGATCTCCCGG 

TAGCGCGCCGACGCGGGCCCGTCC-rCGTCCGCGTCGTCGAGCAGCGTCAGCGACAGGCTGATC 

GAGGCCATCGGGTTCCGGAGCTCGTGCGCGAGCATCGCGAGGAACTCGTCCTTGCGCTGATCG 

GCGAGCTTCAGCGCCTCGACGAGCGCCTCCACGCGCCTCCGGGCGCGCACCTGGTCGGTCACG 

TCGAACGCGAACACGAAGACGCCCTCGACCGCCCCGTCGCGATCGCGCATCGGCTGGTAGACG 

AAGTTGAAGAACACCTCCTCCGTCGTGCCGTCGCCCCGGCGATCGAGCCGCACCGGGAGCTCC 

TTGCCGACGATGGGCTCGCCGGTGCGGACCACCGCGTCGAGGAGCTCCCAGATGCCCTGTCCC 

TCGAGCTCGGGGAGGGCGGCCCGGATGGGCTCGCCCACGAGCGATCGACCGCCGACGAGCCGC 

TGGTAGAGCGGGTTGACCACCTCGAAGACGTGCTCCGGCCCGCGGAGGATGGCGATGGGCCCC 

GGGGCCTGCATGAAGAGGTCGTTCAGGTACTGGCGCTGCCCCTCGGCCTCGCGCCGGCGGCGC 

GCGAGCTCGACGTGGATGCGGACCCGCGCGAGGAGCTCCTTCGCGGAGAACGGCTTCACGAGG 

AAGTCGTCGGCGCCGGCCTCGAGGCTGTCGACGCGCGCCTCCTCGCCCGCGCGCGCGGAGAGC 

ATCACCACGGCGACGCCGCGGGTGCGATCGTCGGCGCGCAGCGCCCTGAGCAGGCCGAAGCCG 

TCGAGCCGCGGCATCATCACGTCGC-TGAGCACGAGATCCGGCGGGTGGGCGCGGGCGCGCTCC 

AGGGCGGCCCGACCGTCGGCCACGGCCTCCACCGTCCACCCCTCCGCCACGAGCAGCCGCAGC 

GCGTACTCGCGCATGTCCGCGTTGTCGTCGGCGACGAGGACGCGCCCCGGCAGCCTCCCGGCC 

GGCCCCTCGCCCGCCGGCCGGGACC-CCGGCGCCTGCTCGCCGCGGAGCCACTGCGCGGCCTCG 
TCGAGGAAGGGCGCGGCGTCCCGCCCCCCCGCGGCCGGCGCCGAGGCCGGCGCGAC 

or its complementary strand, 
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lb) DNA-sequences which hybridise under stringent conditions 
to regions of DNA-sequences according to (a) encoding proteins 
or to fragments of said DNA-sequences, 

(c) DNA-sequences which hybridise to the DNA-sequences accord- 
mg to (a) and (b) because of a degeneration of the genetic 
code, 



(d) allele variations and mutants resulting by substitution, 
insertion or deletion of nucleotides or inversion of nucleotide 
segments of DNA-sequences according to (a) to (c) , wherein the 
variations and mutants offer isof unctional expression products. 

7. DNA sequence according to claims 1 to 5, wherein the DNA 
is selected from the group consisting of 

(a) the following DNA Sequence: 
Seq ID No 2 <>pEPOcos6 region) 

GGATCACCTGCGGCGCGATCGCCGACCTCGTGCTGGTGTTCGGCTCGCTGGATGAGAAGCCGG 
CGGCGCTACTGATAGAGACGGCGACGCCCGGGCTGCGGGTGGAGCGGTTGCGGGAGATGCTCG 
GCTTTCGGGCGGCCCACCTGGCGAAGC7GTCCTTCGACGGTTGCGAGGTCCCCGAGGCTCAGC 
TGATTGGCCGGCCCGGCTTTGCGCTGATGTATCTGGCCCCCTACGCCCTGGATTTCGGTCGGG 
TCAGCGTCGCCTGGGCCTGCCTGGGCATGATCCGCGCTTGCCTGGAGACCTGCGCACAGCACA 
TCCTCACCCGCCGCACCTTCGGCCACCTGCTAGCCGATCACGGCATGATCCAAACCCTGATCA 
CCAACCTGGGGATTCACCACCAGGCGACGCTGCTCCACACGCTGCAGGCCTGCCGCGCCAGGG 
ATCGCGGCGACGTGACCGCCTCCGAGGCCACCCTCGCCGCCAAATACCTCGCGTCGCGGACGG 
CGGTCCAGGAGACGACCAACGCGGTCCAGATCATGGGCGCGCTGGGCTGCGACGAGGAGGGCG 
CGATCGCCCGCCACTTCCGCGACGCCAAGACGACCGAAATCATCGAAGGCAGCAACCAGATCA 
TCGAGGCGCTGCTGGCCAAGAACATCGCCCGCGCCGGTCGCGACAACTATCGCCGCTTCCTCG 
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ATGCGGAAGTCGAGCCCGGTCGGGCCGGAGGCGCACCATGACGAGCGCGGTCCCGACGCGTCA 

AACCAGCCTGCTCGACGACTTCGAGCGCGTCGCCGACGTCGATCCAGAGCGGATCGCCGTCCA 

CGCGAGCGAGACGAGCCTGCGCTATGGCGACATGAATGCGCGCGCCAACCGCATTGCCCACGG 

GCTACGGGCGCGCGGGATCGGGCCCAATCAAATCGTGGCGGTGGCGATGGCCCGCACGCCCGA 

GCTGATGATCGTGCTGTACGGCATCCTCAAGGCCGGCGCGGCCTACATGCCCATCGCCCGCGA 

CGCGCCGCCGCTGCGCCGCGATCATATGCTGCGCGAGAGCCAGGCTGCTCTGATGATCGCCGA 

CGAAGAGATCGCGGGACTCGCGGCCCGGGTGCTGACGCCGGCCGACCCGTTCTTCGCGGCCAT 

GCCGGACCACAACCCCGAGCCGCGTCACGACCCGACCGACCTGATTTACGTCATCTACACCTC 

GGGCTCGACCGGCCAGCCCAAGGGCGTGGCCATGGAGCACCGCGCCGTGTGGAATCGCCTGAC 

TTGGATGCAGGCCCAGTATCCAATCGACACGCAGGACGTGATCCTCCAAAAGACGCCGATCGT 

CTTCGACGTGTCGGTCTGGGAGCTGTTCTGGTGGCCGCTGGCCGGCGCCTCGGTGGCCCTGCT 

GCCGCAATCCATGGAGAAGTTCCCCTGGGCGATATCGGCGACGGTGGCGCGGTGCGGGGTGAC 

GGTGATGCATTTCGTACCATCGATGCTGATGGCCTTCCTTCAGGTGGTGGCGGGGCGGCCCGA 

GATGGCGGACCAGATGAAGGGCCTGCGCTACGTCTTCTGCAGCGGCGAGGCCCTGGCGCCGGC 

CCACGTGTCAGCCTTTCAGGAGCACATCAACCGAGCGGGCAGCATCAGCTTGACCAACCTCTA 

TGGACCCACCGAGGCGGCGGTCGACGTCAGCTACTTCGACTGCCCGCCCGGCGCGTCACTCGC 

GCGGGTGCCGATCGGACGAGCGATCACCGGCATCCAGCTGCTGGTCATGCGCGACGGCGTGCC 

TCAGCCGCCCGGCGTCGAGGGTGAGCTCGCCATCGGCGGCGTTGGTTTGGCGCGCGGCTACAT 

CTCACGGCCAGACCTGACCGCCGACCGGTTCGTGCCGCATCCAGGCGGCGACGGCCAGCGGCT 

CTACCGCACCGGCGATCTGGTGCGCAGGGACGCGGACGGCGAGCTGGTCTTCCTGGGGCGCAT 

CGACCATCAGGTGAAAATTCGCGGTCTGCGCATCGAGCCCGGGGAAATCGAGGCCCAGATCAG 

CGCCCATCCCGATGTGGCCGACTGCGCGCTGATTATCGAGCAGGACTCGGAAACCCTGCCCAA 

GCTGACCGCCTACATTGTCGTGGCGCGACCGGGCTTGACCCGGAAGGCGCTGCTACAGTTCCT 

GGGCGCGCGGCTGCCCGACTACATGCTCCCGAACCGCTTCCTGACCCTCACGGAGCTGCCCGT 

GACCGCCAACGGTAAGCGCGACTGGCGCGCGCTGCTCGGCCCGCTCGAGACCCTGCCTCTCCC 

TTTCTCCTGAATCCAACCAATACGAGGGATTCATGTTACACCCGATTCCCACCGACCGTTTCG 

CCCTGAGCCGACCGCTCTTTCGCGGGTACCTCGCGCACGATCCGATCGTGCAGGGCGTGCTGG 

CGGGCGACCATCCAGGCTGGGTCCTGGTGGACCGCGAGCCCGAGCCGCGCACGGCGCTGCTGT 

GGGCCTTTTCCGATCGGCTCTTCTGCGTGGGCGCAGCTGACACGCTGACCCCGCACGCGCTGG 

CCGAGCTGTTCCACGACCGACTGATCCCCCAGGCCCGTAAGATCGGGCAGCCGTTTTTCCAGG 
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TTCAGGGCGAGACGGTCGACACCTGGTCGGACCACCTGCATCAGGTGTCGCCGCACGCGACAG 

TCTCCTTCCGCCAGGCATTCCGCTTCGACCGCGACCTCTTCGAGCGGCTGCCAACCAAGCCGG 

AGCTGGCAGAGGCGCGGCTCGTGCCAATCGAC3CGCGGCTGCTGGCCGAACAGGCTGATCTGC 

GCGAGCGGATACTGGCCTCCTGGTCCAGCGAAGCTGCCTTCCATGCGCGCGGTTTCGGCTTCT 

GCTACCGCGTAGGTGACCAGCTGCCGAGCGTGTGCCTGGCATCGCACGTAGGCGGCGGCGCGG 

CCGAGCTGAGCATCAACACCGAGCTCGAAGCGCGCAATCGAGGTATGGCAACGCGGCTGTGCC 

GGCGTTTCATCGCCGAATCGCTGCAGCGCGGCCTGACGCCTTGCTGGGGCACCGAGACCTTTC 

GCCTGCCGTCAATCGCGCTGGCCCAGAAGCTCGGTTTCATCCCGACCTTCACCTTCCCCACCT 

ACTGCTTCGCGACCGGCACCGAACAGCCGGACGACAACTTCCTAGGCGAGCTGTACTACAGGG 

AATCGCGCATCGCCGGAAGTGGGACCGATGAGCCGCAAGCGGTTCGGCTGGCGCGGGGTTGGA 

GCCTGGCCGGCGACACCGAGCGTGCCGCGAGCTTCGCCGCACGCGCCCTGGCCGAAGGGTGGG 

CCGGCCACTCGACTCTGGCCACCGATCCGGATTTCGCCCGATTGCGCGCCAGCGCCGCCTGGC 

CCCGCCTCAATGTCCCTTGAAAGGTCACGTGGACTCATGATGTCCCCTTGAAAGGTCACACTC 

CGAGTCATGATGATTTGTCACTCCCACCGCTTCATTTTCCTCCACGTTCCCAAGGTCGCCGGC 

ACAAGCGTCAAGGACGTCCTCGGCCAAGAGCTATTCCAGGAGGACCAGGTCACGTTCCAGATC 

GCfCCCAATCCCCACTACCCACCTGAATGGACTGCGCCTTACGAGGAGCACATTATTGCCGCT 

GAATTGAAGAGCCAGTTGGCGCCGGAAATTTGGGACGATTACTTCAAGTTCGCCTTCGTGCGC 

CATCCGCTCGACTGGGCGGTCTCCAATTACTTCTTCTTCCTGCGCGACCGCAAAGGCCATCCG 

GCCCACGAATTCCTGGAGCGGAAGGGCTTCGCCGGTACCATGGACATGTTTTTCGGAGCGGCC 

GGGCGCCATCCGCTGGTCGCCGGCATGCGCTTCAGCCAATGGGAGTTCTTGTGCGACAGCGAG 

GGCCGGACGCTGGTGGACTTCGTTGGCAAGTACGAGCGGCTCGAGCAGGACTTCGCCGCCGTG 

TGTATCCGCATCGGGCTGACCCCGCCCGACTTGCCGTGCCTCAACCAGACTCGCCACCAATCC 

TTTACCAGTTACTACGACGAGGCTTTGATGCGCCAAGTCAGCCGCGCGTTAGCTCGCGATTTC 

GAAATTTTTGATTATGCCTGAGGCGGACCCGTTGCTTCGCCACCGGTGGATTATTCGATAAGT 

TATTATATTTTCAGTTGATCATGTGAATGTCGATCCAGCCAACGAGGAGGATACCTCCGCGTG 

CGGCTATGGGGGCGCAGAGGTCACGACTACGTGTAGAAATTTGTCGAACACACCACTAGCTGC 

CAGCGATTGGGAGCTTTGACTTGAAGATGAAAGTGGACAAGCGGAATGTCGACGACATTCTCG 

GACTCACTCCGACACAGACAGGCATCTTGTACCACTACCTGCTGGACCCGCAGGCCGACGCCT 

ATTTCGAACAATTGACGCTGCACCTGGAGGGGCCGCTCGACGTAGCGCGCTTCCGCCGCGCCT 

GGGAGCGCGTGGTGGCGGCTCACGACCAGCTGCGCGCCGTGTTTCGCTGGCAAGGGATCGAAC 
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ACCCGGTGCAGATCATCCTCAAGCAGCACGTGCCGGACCTGGAGTTGGCGGAGGTCCCGCGGG 

ACGCCGATCCGGCAGCCTTCCTGGCGCAATGGGTCGCGGCCGACCGGGCGCGCAAGTTCGACT 

TCGAGACGGTGCCCTTTCGCATCGGCCTCTGCCGGACTGATACCCAACATCACGTGATGCTGC 

TCAGCAATCACCATATCCTGATGGACGGTTGGAGTACGGGCCTGATTCTGCGGGACTTCCTCG 

CCTGCTACGGCGACTCCGAAAACTGGCGGCCACGCACCCGAACGCACTTCAAGGCGTTCATCA 

AGTGGCACCAGAACCGGCCACGCCGGGGCGAGGAGCGATTTTGGCGCGACCTGTTGCGCGATG 

CGCCCGACGGCGGCTTTCCCCGCCTGGGCGTCGAAGAAGGCACCCGCCACTCGCTTGACTTCG 

GCGCCCGCAGCCGCGCTCTCGACGACCGCTTGACCCAAGGCTTGCGCGACATGGCTCGCGACC 

TCGACGTCACCCTCGCCGCGATGCTCCATACCGCTTGGGGCCTTCTACTCCAGCGCTACCAGA 

ACAGCTGCGAAGTGATATTCGGGACCACCGTTTCCGGCCGCAACGTCGAGCTCGCCGGCCTCG 

ACGAGGTGGTCGGCTTGTTCATCAACACGATTCCGTTCCGCTTCTCGGCCGCGGCCGCGACGA 

CGCCCGTCGAGGCCTTCCGTGCGGTACAGCGCAATCTGCTGGCGAGAAGCGAGTTCGAAGCCA 

CCCCGCTGGTGGACATCAAGGGCTGGAGTGGTCTCGGTCCGGGCGCGGAACTGTTCGACACCA 

TCCTGGTCATCGAGAACTATCCCTTGGACCGCGCTATCTTCGAGAGTGATTCCAGCCTGCGGT 

TGACCGACCACCAAATCTTCGAGCGCACCAATTACGGGCTGACCCTGACCATCGAGACCTTCA 

GCCGGTTGCACGTGACGCTAGCCCATCGCCGTGACCTGCTGGGCGACGCGGCCGCTGAGCGAA 

TGCTAGATCATTTCACCGGCCTGCTCCAAGCCATGCTGCGCTTCCCTCACCAGCCGTTCGCGC 

GCCTCGAGATGAAAAGCGAACACGAGGCCCACCGCGTCCTGCACCAACTCAACCAAACGCGTC 

AGCCGCTGCCGTCCCAATCGGCTTTCCACCAGTTGTTCTTCGAGCAGGCCCAGGCCGATGGGG 

CACGACCGGCGCTGTGGTGCGGCGCCACGCGCTGGACCTACGGCCAGCTGCTGGAACGTGCCC 

TGCGTCTGGCGGGACGGCTGCAGGAAGCCGGCTTCGCCCGAGGCGATGTCGCCGCCGTCAGCC 

TCGGCCCGGTTCCGGATCTGATTCCCGGTTTGCTGGGCCCGCTGTTCGCCGGCGGCGCCTACC 

TGCCGCTCGATCCCACCCTGCCGGCCCAGCGCTCGCGGTTCATCCTCGACGATGCCGGTTGCC 

GCTTCCTGATCAGCGACGCGCCACTCGCGGGGCCCACGCCGATCCATCCGGACCCTGCCGGCG 

CCAGCCCCGTTGACGTCATTTTTGCCTGTCAGGACGGCGCCGCGCAGCCCGCCTACCTGATCT 

ACACCTCGGGCTCCACCGGCCAGCCCAAAGGCGTCTGGGTTAGCCACCGCAACCTGATCAACT 

TCCTGACGGGCATGAGCGCAATCCTGCCGGTCGCGGCCGACGACGTGTTCCTCTCGCTGACTA 

CCGTGTCGTTCGACATTTTCGGGCTCGAGACGTGGTTCCCGCTCAGCCGCGGCTGCACGATCG 

TCTTGGGCACGCGCGCCGAGCAGTTGGACCCGGCCGCGGCTGCCAAGGCCATCTCCTGCCATG 

GCGTCACGGTTTACCAGGCGACGCCATCGCGACTCCAACTTCAACTGGAGCACCCCACATTTG 
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TCCGCGCCATCGGCTCCCTGACGACCCTGCTGGTAGGCGGCGAACCCCTCCCAGCCGAGCTGC 

TGCGGCGCGTACGCGAAGTGACCGATGCGCGTATCTTCAACCTCTACGGTCCCACCGAAACCA 

CCATCTGGTCCACAGCCGGGGAGGTCACCGCGGCGGACGTCCCGGATATCGGCCGCCCGATCG 

CAAATACCGGCGTTTTCCTTCTGGCGCGAGACGGCTCGATCCAGCCGCCGGGCCTGGTGGGCG 

AGTTGTGCATCGCCGGCGAGGGCGTGGCGTTGGGCTACCACCGACGGCCGGACCTGAACCGAG 

AACGGTTTCGCGAGATTCCGCCGGGCCGCCTGCCCTTTGCCGGCAAGCTCTACCACACCGGCG 

ACCTGGCCCGCTGGACCGAAGACGGACGGCTCCTCTGCCTGGGCCGTCTGGACGACCAGCTCA 

AAGTGCGCGGCCATCGCGTCGAGCCGGGCGAGATCGAGGCAGTGATGGCGCGCCACCCGGCGG 

TCACGCAGGCGGTGGTCGTCACGCGGCCGCGCAACGGCGAGCCGGTCTTGGTCGGGTTCTGGA 

CTGCGGAAGGTGAGCCGATGCCAGAGGAAGCGCTGAGCGCTTACCTGGCCGACCGACTGCCGA 

GCTACATGGTACCCGAACGGTGCArCCTCATGAAGGCCATGCCGCTAACCGGCAACGGCAAGA 

TCGACCGGCGCGCCCTACCCAATCCCTTCGCCTTGACCGAGTCGACCCGGCAGGCGGCGCCGC 

GCACCTTGGCCCGCACCGCCGGCGAGCATCGGGTTGCCGAGCTGTGGCAGGCCTTGTTGCGAC 

GCGAGGCGATCGGCTTGGACGAACCCTTTTTTCAGGCCGGCGGGAACTCATTCGGCTTGATTC 

GGCtTCACGCCAAGCTGGAATCCGCCTTCGGGAAGTCGTTCCCGATCACCGATTTGTTCCAGC 

ATACCAGTATTCGCAGCCAGGCAGAAATGCTGAGCGGCTCGTCCGTCGAGGCGCCGCTCGCGG 

GAGCCGTGCCGCAACCCCCGGCCGCCGCCGCCCAAGTTGCCTCCTCGGCAGCTAAATCCCCAG 

GGGAGCGCGGCGCGGCAGCGACGTCGAGCGGCCTGACCGCGCAACCGCCCCAACCCCACTTCC 

GGCCCATCGCCGTTATCGGCCTCGCCGGCCGATTCCCCGCCGCACCCGACCTCGACGCCTTCC 

TTGAACTGCTCACGGAGGGTCGCTGCGGCATTCGCTTCTTCAGCCAAGCCGAGCTGCGCGACG 

AGGGTCTCGACGCGAATCGAATCGCGTGTCATAACTATGTCCCGGCCAAAGGTTTCCTCGACC 

GGGCCGACCACTTTGATGCCGACTTCTTCGGCATCCCGCCGCGCGACGCAGAAATCACCGATC 

CGCAAATTCGGCTTCTGCTTGAGTGCTGCTGGAACGCGCTGGAGCATGCCGGCTACCCGCCCG 

GCGGCGGCGAGATCGGGCTCTTCGCCGGCTCCTCGGCCAACTATCACTGGCTCGAATACGTGG 

GCATTTCCGAGGAGAGCAGCAATCGATTCGCCGTCATGATTCAAAACGAAAAGGACTACCTGG 

CCACGCGGATCGCCTACCAGCTCGAfTTGAAGGGCATTGCCGTCACCGTGCAAACGGCCTGCT 

CGTCGTCGCTGACCGCGGTCGAGCTGGCCTGCGATGCGTTACACGCCGGCCGCGTGACCATGG 

CTTTGGCTGGTGGCGTTGGTCTGACCTATCCGTTGCGCGCCGGATACCTGCACGAGGATGGAA 

TGATCTTCTCCCCCGACGGTCGGTGCCGGGCCTTCGACGCCCAGGCGGCCGGCACGGTCTGCG 

GCAACGGTCTGGGCATGGTGGTGC7GAAACAGCTCGACGCGGCGCTGGCCGACGGCGATGCCA 
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TCCACGCTGTGATTAAGGGCATCGCGGCCAACAACGACGGCGCGGCCAAGATCGGCTACACGG 

CGCCCTCGCAGAACGGTCAGGCGCGGGTGATCCGCGCCGCCCATAGGCTCGCCCAAGTCGCGC 

CGGAGACCATCGGCTATGTAGAAGCCCACGGTTCGGGCACGCCGCTGGGCGATCCGATCGAGG 

TGGCGGGCCTGACCGAGGCCTTTGACAGCCCGCGTCGCGGCTTCTGCGCCTTGGGTTCGGTCA 

AGTCGAATGTGGGTCATTTGGATGCGGCAGCGGGCATCGCGGGTTTCATCAAGGCGGTGCTCT 

CGCTGTCCCATCGGACCCTGTTCGCCAGCCTCCACGTCGACACGCCCAACCCGCAGATCCCGT 

TCGCCGACGGTCCGTTCCAGGTCAACACGGAGACCCGGCCCTGGCCAGCTGCCGACCATCCCC 

GCCGCGCCGGCGTCAGCTCCTTCGGCATCGGCGGCACCAACGTGCACGCCGTCCTGGAAGAGG 

CGCCGCAGTTGGCCGAGCACGCGGGGCGGCGGCGCGAGCGGCAGCTGTTCCTGGTCTCGGCGC 

GGACTGCAGCCGATCTGGAGCGACGCACCGCGGCGCTGGTCCGCCACCTGGCCGCGCATCCGG 

ACCTCGCACCAGATGACGTTGCCTTTACCTTGCACGCGGGCCGCAAACCGATGACCCACCGTC 

GTTTCCTGGTCGCCGCCGACCTCGCGGAAGCCGCCGCGCGTCTGGCCGAGCCCGATCCAGTCA 

AATCCGCCGCGGCGCGCGCCGACCGCTGCCAGGTCTGGATGTTCGCCGGTCTCGGCTCTCAAT 

ACCCCGGCATGTGTGGCGGCCTCTATCGCACCGAGCCGGCCTTTCGCGAGCAAGTCGACCGCT 

GTTTCGACCTCCTCGCGCCGCGTTGCGATTTGAAGCCCTCGCTCTTCCCCGAGCCCGATCAGG 

CCATCGACGCATCAGCCCTCGCGGCCATCGACACCGCCCAGATCGCCGTCTTCGTCTGCGAAT 

ACGCGCTCGCACGGATGCTGGAAGGCTGGGGGCTGCGTCCGGATCGGCTGATCGGTTACAGTT 

TCGGCGAATACGTGGCCGCCTGCCTGGCCGGCGTCTTCTCCCTGCCCGACGCCTTGGCAATCG 

TCCGCGAGCGTGGCCGGATCCTGGCGGCGGCCGAGCCGGGCGCGATGGTCAGCGTGCCCCTTC 

CGGCCGAGCGCGTCGCGTCGCTGCTGGAGCCGCCGCTTGCCTTGGCCATTGACAACGGCCCCT 

CATGCGTGGTGTCCGGGCCGGTCGAACCGGTGCGCACCTTCACCGCTCGCATGAAGCGGGACC 

GGGTCTGGGTGACGCCGCTCCAGGCCGAGCGCCCGATGCATTCGCCGCTGATGGCCGAGGCCG 

GCGGCTCACTGCGCGCCATGTTGGCCGGGTTCCGCCTGAATGCGCCGCGAATCCCGATCTTAA 

GCAATGTTACAGGAACCTACCTAACCGACGAGCAGGCCCGAGACCCCGATTACTGGGCCCGTC 

ACCTGTGCGGCAACGTTCGCTTCGCCGACGGTGTGCGAACCTTGTTGGCCGAGCGCGATCCGG 

TGTTCCTTGAATTCGGGCCGGGCCGCGATCTGAGCTCCTTGGTGCGCCACCAGATGCCGGAAG 

GCGCCGACGAGCCGATCGCACTGATCCGTCATCGCGAAGATCCGGTGCGCGACGAAGACCTCC 

TGCTCGATGGCTTGGGCCGCTGCTTCCTGCGTGGGGCGACCCTCCACGGGCAGGCCTTGTACG 

CCGGCCGAGGCTGCCGCCGCGTGCCGCTGCCCGGTTACCCGTTCCAGGGTCCACGCTGCATGC 

CGGCCCGCGCCGGACTGCCCGGCCTGGCGCGACCGACCGTGGGAGCGACCACCATCAGCTACC 
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GACCAGCCTGGAAGCGC3GCGCCGCGCTTGGCGGCTGTCGAATCGCTCGCGCCGCAATCCTGGT 

TGGTATTCAGCGACGGCAGCGAATTGGCGGGCGAGCTGGTGGCCGGCCTGCGCGCTTCCGGTT 

GCGCGACCACCCTCGTCGAAGGTGGGCTGGCGTTCGCGCGCTTCGCGGGCGGCTTCCGCGCGA 

ATCCCCGCGAGGAACAAGATCTCGCACAGCTGTTCGCGACCCTGTCGGCCGAAGCGATGCTGC 

CCACCCACATCCTGCACCTGCTCAGCCTGCCGTCGCCGGAGCGCGACTCGCCGCTGGCGCGCC 

TGGAGCACCTCACCGAGCTGGGCTf CCACCATCTGCTGGCCCTGGCCCGCCAACTGGAGGCGG 

TCGGCGCCCCCGAGGTCCGCCTCGCCGTGGTGACAACCGGCCTGGCGGCGATTGGCGGCGAGT 

CCGAGCTGCGGCCCGAGGTCGGGCTGTTGCGGGGACCTGTCCGCGTGATTCCCTTTGAATTCC 

CGAACTTGCGGCTGCGCCTGATCGACCTCGACTCGGCCGATCCCATCTGGCGTAGCGGTTGTG 

AGCCGTTGCTGCGCGAAATGGGCGCTGCCCCGGGACCTGAAGAAATCGCGCTGCGCGGCACCA 

GCCGTTGGGAGTTGGGCTACGAGCCGGTCGAGGGGGGCACCGTGAGCACCATCTCCTCGCGAC 

TGCGCGAGGGCGGCGTCTATCTGATCACCGGTGGCCTCGGCGGCCTGGGTCTGGCCTTGGCCC 

GTCACCTCGCCCGGAAGTACCGCGCCACCCTGATCCTCGCTGGCCGGCGAGGCGCGCCGGCGC 

GCGAGCTCTGGCACCAGGCGCCAGCGGAGTTCGTACCGGTCGCAGCTGCGATCGCACAGATGG 

AGGAGTGTGGCGCCCGCGTGATTCCCGTCGCGCTCGACGTCACCGACGCCGACCAAGTGAACG 

CGTTGTTCGCCACCATAGAAGCTACGGTCGGCAAGATTGAAGGCGTTTTCCACATGGCTGGCA 

TCGTTGACGGCGGCATCATTCGAACGCGCACGCGCGCTGCCAGCGACGCCGTGCTGGCGCCCA 

AAACGGTCGGAACCTGGATTCTCGATCGGGCTCTCCGCGGCGCCGGTGGCCGCTTCCTGGTGC 

TGTACTCCTCGATCAACGCGGTCGTCGCGCCCTTCGGCCAGGTTGCCTACGCCGCCGCCAACG 

CCTTCCTCGACGCCTTCGCCAGCGCCCACGAACACGACGAGCGTCTTTTCCGCGTCAGCATCG 

GTTGGGACACCTGGCGCGAGGCCGGCATGGCCGTCGATGCCGCCCGCGCCCGCGGCGACCAGG 

CCCCGCTCGAAGGGCTTAGCGACGAGCAGGGCTTGCGCCTGCTCGAAAGCGCCTTGGTCGGTT 

GCGAACCGCGACTCCTCGTCTCCATCAGCGAACTGCGCGCTCGACTAGCCGAGCATCATCGCA 

ACGGCGGCATTCCCCGGTTGCTCGGGCCCCGCGCCAACGAGGCGGGTGCAGCTGATTCCGGCG 

AGGAGGGCGCCACGCAAGACGCGTCGCCGGCCCGTCGCGCCCGTCCCGATCTGGTCGTGGCCT 

TCGCGCCGGCCGGCAACGAGCTGGAGCGCCGGATCGTGGCCATCATCGGCGCCTACCTGCGGC 

TCGGTCAGGTGGGCGTCGACGACAACTTCAACGATTTGGGCGCCACCTCGCTCGACCTCATCC 

AGATCGCCCAACGCCTCGGTCGCGAGTTGGGCCGCGATGTCCCTGTCGTCTCGCTCTACCAAC 

ACCGCACCGTACGCGGGCTGAGCCGCTTCCTCGGCGGCGCGCTCCAATCCGCGCGGTCCGGCG 

TCCCGACGGGCGCTGCCGCACCGG3CGCCGCCACGCCGGGGGTTGCCACCCCGCCGCGGCCAC 
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AACCGTCGCGCCAGCACCTGGAAAAACGCCGTCAATTGAGGAAAAAAGGGGGGCCTTCCCATC 

ATGAGTGAAGTATCCATTCGCCCCGGCTTGGACATCGCGGTCATCGGCATGGCCTGCCGCTTT 

CCCGGTGCCCGCAACCTCGCCGAC-TATTGGGCCAACCTGATCGAAGGCCTCGAAACGCTCAGC 

TTCTTCAGCGAAGAGGAGCTGCGTGAGGCCGGCTGCGATCCGGTCCAACTGGCCCAGCACAAC 

TACGTGCGCACCAAGGGCCTGCTCCCTGACGCAGACCGTTTCGACGCCGATTTTTTTGGTTAT 

TCCCCGCGCGAAGCCCAGGTGATGGACCCCCAGATCCGCGTCTTCCACGAGGTCTGTTGGCAG 

GCGCTGGAGCACGCGGGCTACAACCCGCATCGCCACACCGGCACGATCGGCCTGTTCGCCGGC 

GCCGCGCCCAACGTTTTTTGGGAGTTTCTCTCCTATCGGTCCGATGCCGCCAATTTAGGCAAC 

TTCACGCTGGGCCTGCACAACAACAAGGACTACCTGAGCTCGCGCATCGCCTACAACTTCAAC 

CTGACAGGGCCCAGCTACACCCTGTTCACCGCCTGCTCGACCTCGATGGTCGCCATCCACCAG. 

GCCGTCCAGGCGCTGCTCAACGGCGAATGCGACCTGTGCATGGCCGGCTCGGTCTCCATTACG 

CTGCCACTGGTTGCCGGCTACACCTACACGCCGGGCATGATCGTCTCGCCCGACGGCCATTGC 

CGCACCTTCGACGCAGGCGCCAATGGCACTGTCTACGGCGACGGGGCCGGCGTGGTCGTTCTC 

AAGCGGGCCGAGGATGCGTTGGCCGACGGCGACCACATATTTGCGCTCATCAAGGGCTCGGCG 

CTCAACAACGATGGCAGTCGCAAGACCGGCTACACCGCGCCCAGCGTGCAGGGGCAGGTGGAG 

GTGATCCGCGCGGCGATGAACCTGC-CGGAGGTCGAGCCGGAGGCGATCAGCTACGTGGAAACC 

CACGGGACGGGCACCACGGTGGGCGATCCGCTGGAGTTCGAGGCGCTAAAGGAGGCCTTCGGA 

GGTGGCTGCAAGGCCTTCTGTGGATTGGGTTCGGTCAAGCCGAACATCGGCCATCTGGACGTG 

ACGTCGGGGATCGCGAGCTTCATCAAGCTGGTCCTGGCGCTGGAGCACCGCATCCTACCGCCC 

ACGCTCCACTTCCAACTGCCCAACCCGAAGATGGATGTGGTCGATAGCCCCTTCTACATCGTG 

GCTGAGCGCGAACCCTGGCGCGAAGATCTGCTGCCGCGTCGGGCCGGTGTCAGCGCGTTCGGT 

CTGGGTGGCACCAACGTCCACATGATTTTGGAGGAGTTTCAGCGCGAACCGGCGGCGAACAGC 

GCGCGCACGCGCCACCTGACGGTGCTGACGGCGCGGTCGCCGCAAGCCCTGGCGCAGCTGGCG 

GCCAACCTCGCCGAACACCTGCGCGAACACCCCGAGTTGGCGCTGGCCGATGTGGCCCATACG 

CTGCTGCACGGCCGCAAGCCACATCCATTCGCGCGCATCCTGGTGGCGACCGATACGACGGCG 

GCGATCGACGCCTTGATGAACGACC3CGATCCGCGAACGCGTTTCTTCGAAGCGACCGGGCGC 

GGCGAGTCGGTGATCCTGTGTTTTGACGAAACGCCGCCGGAGCCGCGAAGCGCCCGCTACCTC 

TGGGATCACGAGCCGCTTTATCGCGCGGCGGCGACGTCGTGCTTGGCTGGTGAGGTCGCCGAC 

CCGGATCTGGAAGGCTGCTTTACTGCCCTGATCGCCGAGCAGGGCGCGGCAGCCGCCTTTTGC 

CACCAATACGCGCTGGCCGGATGGCTGCTGGCCATGGGGTTGACCCCGTCGGCGTTGATCGGC 
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GTGGGCCAGGGCGAGTGGGTAGCAGCGGCGCTCGCGGAGGTGTTCCCGCCATCGGCCTGCTTG 

CGCTGGATTAGGTTCGGCGAACGGCTCCCGCAGCCGCGCGATCAACGGATTCCGTTTCTCTCC 

AATTTCTCTGGAAACTGGATCGTTGGGCGTGAGTTGGCCGACCCGGATTACCCCAGAAAGCAG 

AAGGGTAAGCGCTGCATGAAGCGCCGTCGGTCCCAACCTCGGTCAGCTGGTGCAGGATGGGGG 

CGATGGAACCGGCTCGGTCAGCTCGTCGCGCGCTGCTCTTCCGCGGGAAGCGGAGGCGGGACG 

GTGATCGGCCCGAGGGCGAGGTTCATCTCGTCGTCGACGAGCCGGGCGCGGGTGCGCGCCCAG 

TACCTGGGGGCGAGCTCGAGGTAGCGGTCCCGCGGCCAGTAGGGCATCGCGCGAATGACGTCG 

GCCAGGTAGGCCTCCGGGTCGAGCCCGTGCAGCTTGCAGCTCGCCACGAGCGAGAAGAGGTTG 

GCCGCGGCGGAGGCGTGGTCGTCGCTGCCGAAGAAGAGCCAGGACTTTCTCGCAACCGCAATG ■ 

GATCGCAGCGCTCGCTCGCTGGCGTTGTTCTCCAGGCGCAGCCGACCGTCGTCGAGGAAGCGC 

CGCAACGGCTGCTCTTGGTTGAGGGCGTAGCCGAGCGCGGTGGAGACCAGGCCGCGCTCGCGG 

GGACGAGCGTGCTCGGCCCTGGCCCAGGCAAAGAACGCGTCGACCAGAGGGCGGACGACGACA 

TCGCGACGCACCTTGCGCTGCGCGGGCGGCAGGTCCGCCAGCGCGCGATCGGCGGCAAAGAGG 

GCGTTGATGCGCCGCAGCCCCTCGACACCGAGCTCGTGCTTGCAGACCGCCGCCTCCCAGAAG 

TTGGTACGGCAATGCGACCAGCATCCGACTTCGGTCC3GGGGCGGACCGCGCTTCTCGTCGGCA 

GCAGCGCCTCTTGGTGGTGTGCCGCGGAAGAGGGCGTCATAGATGGCGTGAGCGTCAGCTTGA 

ATATACCGAGAGAAGCCGCGGAACATCTCGCAGACCGCGGCGCTGGTATGCTTGGGCTGGTAC 

TCGAAGAAGACGTGATCCTTGTCCGCGAGGACGACGAAGAAGTGTCCCTTGCGGCACGGCCCG 

GGCTTCTTGTCCTTGCGCTCCTGGATGGGCCCAGGCTGGACGGAGACCCCGGTGGCGTCCGTG 

GACAGGCAGAAGGCGGTCTCGAAGGCCTCTTTGCGCGCGGCCTCGACGATGGCGCCCAGGGTC 

GCACCGACGTCTTCGGCGTAGCGGCACATCGTGCCGCGATCGAGCGACGCGCCCTGAAGCTCC 

AGCTGCTGCTCCAGTCGATAGAACGGGACGCCGAGCAGGTACTTGCTGGTGAGGATGTGCGCA 

ATCATCGACGGCGCGAGGAACGACCGCCGGAACAACTCCTTCGGAAGCGGCGTCGTGATGAAG 

ACCGTGCAGGTCTCGCCCTTCGGCGCCGGCGGCGGCGCGTCGAGCGAGGGCGCGTCGAGCGCT 

GTGGAGGAAGCGCTCGGCTCGCCGGCCGCTGCCGTGTCCTCCGGGCTGACGCTCGCAGCGGGC 

GTCGGCGTCGGGGCTTCTCTCGCGACGACCTGGAGCGGGGCCGCTTCCTCCTCGCCCGAACTG 

CTCGCATCCGTGACGGACCGCTCGGCCTTGTACACGACGCGTGCGAGCACGATGCGGCGCATT 

CCGCCGCGCTCGTAGCCGAGTCGCGAGGTCTCCTCGACCCCGATGCGCGTCGCCGTCGCATCG 

AGCTCGGGGCAGGAGAGCTCGATGCGGACGACGGGCAGGTCGGACTCGGACAGGTCGCGACGG 

CCCTTGCCGCCGGACCTTCGTTTCGGCCCCTTGGGGTCGTCGTGCTGCCGCTCGTCGCCTGTA 
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TTGCGCTCGGCGGCGTCGAGTGCCTTCGCGAGGCGCTGGACCTCGAGGAACATCGAGTCGAAC 

GCCAGCTGCTCCGCGCTCACCTCGGCGCGCTCCGCCTTGGCCACGAACAGTCGACGTCGCAGA 

AGCTGCAGCTGCTCGAGCGCACGGGTGTAGGCGCGCCGAAGCTGCGCGAGCGCATCGCGCGCT 

CCCACGAGCTCGCTCTTTGCCGCGGCGAGCTCCGCTTCGAGCTGCGCGATGCGCTGCTGCTCG 

GCCGAGAGCGTCGGCTTGGCGGCGGCGTCGTGCACGACGCCGCTCTACGTAAGCCGCGCGTAC 

TTGTCGAGCGAATTCGTGCGGCTCAGTGGACGCGGCGCGGTGCGCGCCTTCGCGGTTTGGACG 

TGGGCGCGATCTCGATGCCGTCGAGCAGCGTCTCGAGCGTGGCGTCGTCCACCTCGACGTGCG 

TGGCGCCCTCGGTCGGGGGGTCGGGAAGTGCGAACGCTCCGCGATCAAGGCGTTTTGAAAACA 

GGCAGATTCCACTGCCATCGAAGAAGAGAATCTTGATCGTGGTCCGCCGCTTGCCGACGAACG 

CGAACAGCGCTCCGCAGCGAGCCTCGTACCCCACACGCTCACGGATGAGACCCGAAAGCCGCT 

CGAAGCCGTAGCGCATGTCCACCGGCTCCAGCGCGACGAACACCTGCACGCCCGCCGGAATCA 

TCGCCCCGCTCCGCCGAGGGCACGGACCACCTCCGCCAGCAGCGCGGGGTCGAACCCCGCGGC 

GACGCGCACCCGCGCGCCGCCGACCTCGACGACGAGCTCCGCAGCGCTGCTCGTGACGGCGGG 

CGCCTTCGGCACCAGGCGCAGAAAGCGCGGTGGCTCGGCCCGCGACAGCCGGCTCGACCAGCC 

GTGCAGCGTCGAGGCCGCAAATCCGCGGCTCCGAGCGAACTCCTCCGCCGTTTCACCACTCTC 

GCGCCACGCCCGAACGCGCTCGGACCACATCACTTCGGTCGCCTTCGTCCTTGTCATGCACGC 

CATCATGAACTGGACAGCGCAGCCGGGGTGAGACGGCGCTTCGCGCAGCGCTTACGCAGAAGG 

CGCGCCGCGCGCCATTGTCGGATGCGGTGCGCGACTTCGCCGCCGATCGGCTGTTGCTGGAAC 

TGGGACAACCACTGGACGTAACGGCTGAAGCGAGCCAACGGCTCCAGCTCGCGCGGGGCGACC 

TGTTCGGCGCCTACCAAGCGTTGGCCCAGCTCTGGATCTGCGGCGCCCTGGCCGAACCGCCGC 

GACTGTATCCCGACGAACACCGCCGGCGCGTGCCGCTGCCGAGCTACCCCTTCGAGGGAAAGC 

GGTTCTGGATCGAGGGCTCGCCGTTCGAAACCGCGCCCGCCGCCGGCGCCTCACCCCAACCCG 

CCGATTCGGGGGACATTCTCAAGGGCGACCCGGCGGACTGGTACTATCGGCCGCGTTTCGAAG 

CGGCGCCGCTCTTGCCCAGCCCGTTCGAGAGCGAACCCGGCGATTGGCTGGTGTTCGAAGATG 

AGCTGGGGCTCGGCGCCTGGCTGAGCGAGACCTTGCGCGACAAGGGCGCGCGGGTCGCGACAG 

TCGTTCGAGGCACCGAGTTCCGACGCCTGGCGTCACAGCGCTTCCAGCTTCGTCCCGATCGAC 

GGGACGATTACCGGACCCTGCTGCACGAGTTGAAGGCGCAGGGCATCGCGCCGGTCCACCTGT 

GCCACCTATGGAGCGTGACCGCCGCACCGGATGCCGAGCAGTTGCTCGACGTCAGCTTTCACA 

GCCTGGTCCATTTGGCGGCCGCTTTGGGTTCGGTTGGCTACTTCCACGCCATGAAGTTGAACG 

TGGTCGCCAACCGGCTATTCGACCCCGAGTCGCCCGAGCGCACCGAGCCCGCCAAGAGTCTGT 
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TGCTCGCGGTGACCAAAGTCCTGCC3CAAGAGGTGCCCAACGTTCGAACCCGCGCCATCAGCG 

TGGACCTGGATCGCTCGTTCGACGCGGCGGCGCCCGCCTGGGCCGCCAGTTTGTTGGTTGAAT 

GCGGCGCGCCCGTCGAGGAAACGGTGGTGACCTACCATGGCGCAGCCCGATGGCTGCGCCGCT 

TCGATCGCGTTGCGGTGAATGGTCTCGGCCCGTTCCACCCCGATCAACCTGCGCCGCTGCTGC 

GCGAGCGCGGCGTGTACCTGATCACCGGCGGCCTGGGCGGCGTGGCTGGCCAGTTGGCGCGCT 

ACCTGGCGCGGGCCTGCCGGGCGCGGTTGGTGCTCACCGCGCGCCGGCCCCTGCCCGAGCGCG 

ACCAGTGGGATCGGGAGTCGGCCGTGCTGTCATGGGACGACAAGACGCGCCAGCGCATCGAGC 

TGGTGCGCGAGCTGGAGCGGCTGGGGGCCGAAGTATTGGTGGTGGCTGCCGATGTCGCCGACG 

AAGCGGCCATGGCGCAGGCGATCGAGGCCTCACTGGCGCGATTCGACGCTTTGGACGGCTTGA 

TCCACGGCGCCGGGATCGTGCGGGTCGCGTCGGGCCGCACGCCGATCGGGAGTATGACGCGGG 

CCATGTGCGAGGAGCAGCTCCGCCCCAAGATGTTGGGCCTCGACGTCGTCGACCGCCTCCTGC 

GCGATCGCCGGTTGGACTTCCGCATTGCCATCTCGTCGCTCGCCCCGATTCTCGGCGGCCTCG 

GCCACGTCGCCTACGCCGCCGCCAACCTCTACATGGACGCGTTCGCGACGCGCGGCGCCGCCG 

GCAACGCGCCTTGGATCGCGCTGAACCTGGCCGAGTGGGAATACGAGGGCCCGGCTACCTACG 

ACGAGCGGGTGGGCCGTTCGCTCAAGCAGCTCGAGCTCACCAACGAGGAGGGTATCCGCGTCT 

TCCAGACGGTGTTGGCCTTGGCCGCGCGCGGCCCGCTACAGCAGATCATTATTTCCACCGGCG 

ACCTCCAGGCCCGCCTCGACAAATGGATTCACATCAAATCCCTGCATCGCCGACCGGGGCCGG 

TCCAGCTCAGTCGCCGGACCGCGGCACCCCAGGGCGGTTTCGGCTCGGAGCGCGCCGCCTTCG 

AGGCCGCCTTCGCTGACGCCTGGTGCGACTTCTTCGGGGTTGAAGAGGTCGACCCGAACAAAA 

ACTTCTTCGATCTGGGCGCCAGCTC GCTCGACTTCATCCACCTCGTCAGTCGCTTCAGCAAGG 

CCATCGAACAGCATGTACCGCTCGAGGCCCTGCTCGAACACTCCACCCTGCACGACCTCGCCG 

CCCACCTCGCGGGCGACGCGAACACCGACGCCAGCGACGAAGCGCGCATTCGCCAACGGCTGC 

AAGGCGCCAAGTCCGGCGACATCGCCATCATCGGCATGGCCGGCCGCTTCCCGCTCGCGCCCG 

ACCTGGACACCTATTGGCGCAACCTGGTCGGAGGCATCGACGCGGTCAGCTTCTTCAGCGCCG 

AGGAGTTGCGTGCTGCTGGCGTCACCGCGGCCGAGATCCACCACACCAACTACGTGCCGGCCA 

AGGGGCGCTGCGCCGACCAGGACTTC-TTCGATGCGGCCTTCTTCGAATACACTGCCAGCGACG 

CCGAGCTGATGGACCCGCAAAATCC-CGTGTTACACGAGGTCGTGTGGCACGCGCTGGAAGACG 

CCTGTTTCGACTTCAACGGCGATCACGGCCAGGTCGGCCTGTTCGCGGGCGCCTCGCCGAACC 

TGTGGTGGCAGTTCGTGGCCAGCTTTTCCGAGGCCGCCAAGACGCAGGGCATGTTCACCACCA 

CCCTGCTCAACGACAAGGACTCGA-CGCGACCCAGATTTCATACAAGCTCGGTCTAAAGGGCC 
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CCGCGGTCACCTTGTTCACCGGCTGTTCCACCTCGCTGGTAGCCGTTGACGCCGCCTGCCGCT 

CGATCTGGTCCGGTCAATCGGACA7GGCCGTGGCCGGCGCGGTCTCGCTGACTCTCCCCGATA 

AGGCCGGCTACATCTACGAAAAGGG-ATGCTCTTCTCGGCCGACGGCCATTGCCGGGCTTTCG 

ACGCCAACGCCACCGGCATGGTCTTCGGCGACGGCGCCGGCGCGATCGTGCTCAAGCCGTTGG 

ACGCGGCCCTGCGCGACGGCGACCCGATCCATGCGGTGATCAAGGGCTGCGCCACCAACAACG 

ACGGCGACCGCAAAGCCGGCTACACGAGCGTCAGCGCCCAAGGCCAGGCCGAGGTGATCCGCT 

CGGCC CAGATCCTGGCCGACGTGGC 3 CCCGAATCCATCAGCTACGTGGAAGCCCACGGTAQCG 

GCACCAAGTTGGGCGACTCGATCGAGATCAAGGCGTTGAAGCAAGCCTTCGCCAGCGACAAGA 

ACGGATTTTGCGGCATCGGGTCGGTCAAGACCAACCTCGGTCACCTGATGGCGGCGGCGGGGA 

TGGCCGGCCTGATCAAGACGGTTC73GCGATGAAGCACCGCCAATTGCCGCCATCGCTGCACT 

GCGACGAAGTGAACCCCGACCTGGAC-TTGGAGCGCAGTCCGTTCTACATCAACACCCGCCTGC 

GCGACTGGGTTGCACCGGGCGGGCC3CTGCGGGCCGGCGTGAGTTCGTTCGGGATCGGCGGAA 

CCAACGCTCACGTCATCCTGGAGGAGCCGCCGACGCGCGAGAGCGGCACGCGCATGCGCCACT 

GGAAATTATTGATGCTGTCGGCGGCCAGCGAGGCGGCGCTCGACCGCCAGGCCGATAACCTGG 

CCGACTACCTGGAGCGCCATCCCGAGGCCCACCTCAGCGACGTGGCCTATTCCCTCCAGACCG 

GCCGGCGCGTTCTGGCCTGGCGG CGC ACGGTCCTATGCGAGTACCGCGAGGACGCGGTGACCA 

GTCTGCGCGAGCGACAGGCCAAGCC-CGTCCAGACAAGTCGCGTCCGCTGGGACCACAAGGACG 

TGGTCTTCATGTTTCCCGGTCAGGGCGCCCAGTACCTCAACATGGGCCGCGACTTATACGTCA 

TGGAGCCGGTCTTCCGCGAGGTCAT3GACCGCTGCTTCGAGTTGCTGGCCCCTTTGTGGTCCG 

AGCATCCGCGCCAGATCCTTTATC33GAGGGCGGGGTGTCGACCCTGCTCCACCGGACTGATT 

ACACCCAGCCGATCGTGTTCTGCTT 3GAGTACGCCCTCGCCCATTTGCTGCTCTCCTGGGGAT 

TGAA3CCGGCCGCGACCATCGGCTACAGCTTCGGCGAGTACGTTTCTGCCTGCCTCGCCGGCG 
TCTTCTCCCTGGAAGATGCGATCCGrCTGGTGACCGAGCGCGGTCGGCTGATGGCGGCTTTGC 
CCGCGGGCGCCATGCTCAGCGTCCC3GTTCCCGAATGCGAGCTGCTGCGGCTGCTGGACGGCT 
TCCACGCCCAATCGGCGGCCCATCrGGCGCTGGCCGTCGACAATGGCGCCTCCTGCATTGTGG 
CCGGCGAGCAGGCCGCCATCTCGGCCTTCGAATCGATGCTTCGCAAGAAGCGTCTGTTGACCA 
TGCGSGTCGCGGTCAGCCACGCCGCrCATTCGCAGGTCATGACCGGCGCGACCGACGCCCTGC 
GCAG3ATCCTGCGGAAGATCCCCCT3TCCGCGCCGACAATTCCCTTCATTTCCTGCGTCACCG 
GCACCTGGATCACTGCACAGCAGGC7ACGGATCGCGAGTATTGGGTGAACCACATGTGCGGGA 
CGG73CGGTTCGCGGCGGGTCTGA"GAGCTGGGTCAAAACCGCGAGGCGGTGTTCCTGGAAG 
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TAGGTCCGGGCCGCGACTTGACGTTGCTGGCCCACCGCATCCTGGCCGACAGCGCGGCCGTGT 

TCGAGCTGGTCAAGGCGCCCGACGGCGGCGACGACGATGGGTTCCTCCTGCTGGATCGATTGG 

CCAAGCTCTGGAGGCTGGGGATTTCGATTGACTGGGCCGGCTTCTACGCGGATGAGCGGCGGC 

GGAAACTCTCGCTGCCGGGATATCCGTTCGAGCGGCGGCGCTTCTGGATCGAGGGCAACCCGC 

TGGAGATCGCCGCCGGCAGGCCCAATGTCCAGGGGCCGCTGGTCAAGGCGTCGGACATCGGCG 

CTTGGTTCTACGTGCCGCAATGGCGGCGGTCGGTGCTCGCCGAGCCGGGTACAACGGCGGCGG 

GCGCCGCCGTCACGGCGGAGCAGGCACGCGTCGTGACCGAGCTACGGGCGGGATGCGCGTCGG 

CCGGCTTGGGCAGCGGGGCCTGCGGACTGAATGGCGGTGCCCCGTCCGAGCGTCCGAAAGAAA 

GTGTAGCGCCAGCCGGGTCGACCAGCGCAGCGGCGCAGACCGGCGCGGACTGCCCGACACCGA 

CTGGGGAGCCAGCGGCTGTGCCAAAGGACGGGGCCGAGCCGCGGCCGACCTGGCTTATTTTCG 

CCGACGCCGGCGGATTGGCCGAATCTTTCGCCAAGCGGGTTCAGGCCCGCGGCGAGAAGCTTT 

ACCTGGTGGCTTCCGGCTCGCGCTTCGAGCGCCTGGCCGAGACCCGCTTCCGCCTCGATCCCG 

GGGCCAAGTCCGATCACCGCCTGCTTTTCAAGGCGCTCGACGAGGCCGACATCCTGCCGACCC 

ACCTCCTCGACTTCCGCTCGCTTGACTGCGGCGGGCCCGACGCCGACCCCATGGACCAGGCCG 

GCTTCTTCGGGCTGTTGCACCTGGTCCAGGCGATGGCAGAGGCCGGCTACAGCCATCCCATTC 

GGCTGCTGATCGTCAGTTGCGGCGTCTACGATGTCACCGGTGCCGAACCGCTGCAGCCGGCGC 

GGGCCACGATGATCGGACCGGCTCTGTGCATCCCGCAACAGTATCCGCACCTCGAAACGAGCC 

ATGTGGATTTGGGCGTGGTCCATGCCGACGAGCTCCACGCCGCGCGCCAGCTCGACAGCCTAC 

TTGCCGAATGCCTAAGTGCAACGGCCGAGCGCCAATTGGCGCTGCGCGGCCGACACCGCTGGC 

TGCTGGACTACGAGCCAGTCCGCTTGCCGCCGCTCGACCCGGGCCGTCTGCCCTGGCGCCAGC 

GCGGGGTCTACTTGATCACCGGCGGTTTGGGCGGGATCGGCCGCATCCTGGCCGAACACCTGG 

CCCGCACGACCTCGGCTCGCCTGGTCCTAATCGGCCGCGAAACCCTGCCCGACCGCGACGACT 

GGGACGCCTGGCTGAACCGCCCGCAACCGGTCGACGCCACCCACGAACGGCTGCTGCACAAGA 

TCCGCGCGATTCGCGATCTGGAAGCGCTAGGCGCCGAAGTCCTGGTCCTCGCCGCCGACGTCG 

CCAACGAAGCCGCCATGCGCGAGGCCTACGATCGCGCCGAATCCCACTTCGGCACAATCCACG 

GGGTGATTCACGGCGCCGGCCTGATGGACGCGCAAAGCTTCTCACTGATCGACGCCCTCGACC 

ACGACCTCTGCGCCCGCCAGTTCGAAGCAAAAATCCGCGGCGTCTGCGTGCTCGACCGCGTTC 

TGGCCGACCGCACGCTCGACTTCTGTCTGCTGATGTCTTCCATCTCCACCGTGCTCGGCGGCC 

TGGGCTATTTCGGTTACGCCGCGGCCAACGCCTTCCTCGACGCCTTCGCCCAGGCGCGCAGCC 

GCGACGCCGCTTTCCCCTGGCTTAGCGTGGCCTGGAGCGATTGGAAGTACTGGACCGAGCGCA 
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AGATGGACAACGAGGTCGGCGCCGTCATCGACAGCCTCTCGATGGAACCCGCCGAGGGCrrCG 

AAGCCGTCACCCGCGTCTTGGCTTGGGGCAAGGCGCCCCACATCGCCAACTCGCCCGGTGACC 

TCGGTCGCCGCCGGGATCAATGGGTCAAACTGGCCAGCCTGAAATCGGCGCACTCCAGCGAGC 

CCGAGCCGGCTAGGCATGGACGTCCGGCGCTCTCCAGCGAATGGGTCGCGCCGCGCAACGTGG 

TCGAAGAGAAGCTGGTCGCCATTTTCGAGCAGGTGTTCGGCACTGCGGCACTGGGCATCGAGG 

ACAACTTCTTTGAGTTGCGCGGCGACTCGCTCAAGGCGGTCATGACCGCGGCCCGTATTCAAA 

AGGAGCTGAACGTGGAAGTGCCGCTGCCGACCTTCTTCCAGATGCCCACGGTCGCTGGCCTGG 

CCCAGTTCGTGACGCAAGCCAAGCGCAGCGGCCGGGAGACGATTCGGCGCACCGCGCCGCGCC 

CACATTACCCGCTCTCGGCTGCCCAGGGCCGCCATTACCTGCACTACCGCATGGACCCGCGTT 

GTACCGCATACAACGATCCCTTCGCCAACCTGATCGAGGGTCCGCTGGACGTGGATCGCGTGG 

AGCGCATCCTGCACACCCTCATCCTACGCCACGACTGCTTCCGCACCTCGTTCCACTTCCGCG 

AGGGCGAGCCGGTCCAGGTGATTCACGATCGGGTGGACTTCAACCTGGCGCGGATTACCTGCG 

CGCCCGAGGATTTGCCCGAACGGATGCGCGATTTCATCCGCTCCTTCGATCTGGAGCGACCGC 

CCGCCATGCGCGCCGGCCTCTTCGTCACGGGGCCCGAGCGCCACGTGCTGCTAATCGATTTTC 

ACCACATTATCACCGATGGCGTGTCGTTCGAGAACTTCGTCGGCGAGTTCGCGGCGCTCTACC 

GCGGCGAGATCCTGCCCGAGCTGGAACTCGAGTACAAGGATTTCGCGGTGTGGCAGCATGAGA 

ACCGGGGCCGCCGCGCCAACAGCGACCAGGCCCGCTACTGGACCGAGCAGTTGGCCAATGCGC 

CCGGGCCGATCGAGCTAACCACCGATTTCCCCCGTCCCAGTCGACGCAGCTTCCGCGGCGACC 

GCGTGCGGACCGTGCTTGATGCGGAGCTCGTTGCTCGACTCAAAGAGCACGCGGCGCGCCTCG 

GCATCACCCTCTATAGCCTGCTGCTGGGCGGATTCTCGTTATTGCAGCACAAGCTCTCCGACT 

CGCACGACATCGTCATCGGTTCGCCCGTCGCGGGCCGCACCCGGAGCGAACTCCAGGATCTGC 

TGGGCGCGTTCGTCAACACCCTGCCGATGCGCCACCGCATCGACCCGACCCATACCGCACGGG 

TCTTCTTGGAGCAGGTCCACCAGACAACCTTGGCGGCCCTCAGCTACCAGGAGCACCCTTTTG 

ACGAAATGGTGGCGACGCTCGGGTTCGCCGCCGATCCGGCTCGCAACCCGATCTTCGACACGA 

TGTTCTTGCTGCAGAACATGGCCATGGGTGCAACCACCATTCCCGGTCTGCGGCTCTCGCCTC 

ACGACACTTTTCACCGCAAGGCATTGTGCGACCTGATGCTACAGGCGACCGAGTATGACTGCC 

ACCTGGAGCTGGTGCTCGAGTTCGCCACCGACCTGTTCCGGCTGGAAACCGCGCAAGTCTTGC 

TCGACCGCTACCGCCAAGTCTTGGAGTGGCTGTTGGCGTACCCCCATGAATCGATAGACGATT 

TGACGCTCGCCGGCCACTTTCGCGAAGTCGAAGTGACGATGTCGGACGAGGGCGACTTTGATT 

TCTCAGATTTCGAACCCCGCAACGTGAGAAACCTATGGCGCGCCTGAGCCGCACAGATCTCCA 
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ACTCGCCATTCACCAGCGCACCGTC-3AGCGCGAATATTGGCGCGCTCTGTTCGAGCGCCATCC 

GCAACGGTCCAGTTTGCCGGGGGTGCTCACCGCCCCGATCGGCGACGAGTCGACCCGCGAGAC 

CTTGTCATTGGTCCTCGACGAAGATCCCCTTCGGCTGAGTAATCGTTCGCCGCAACGCCTGCT 

CACGGTGTTGGCGGCTGGCCTCGCGGCTTTCCTCCACCGCTGCGACGGCGCTGAGCGCTTCAC 

CCTGGGGTTGGCCCTACCGCGCCAAGCCGATGACCATCACCCGATCCTCAACAGCTTGATCGC 

GCTGGGGGTCGCGGTCGACTCGAGTACGACCTTCCGCGATCTGCTCTATGCGCTTCGATCCGA 

ATACCACGAGGCGATGCGCCACGCCAACTTTCCGCTGGCGACCTGGTGGCGCGGCCTACCCGG 

CGGAACGGCGCCGTTCGACGTCGCCCTCAGCCTGGACCCCTTCACAGACGGCGATTCGCTGGA 

AGACCACGCGATCGGCGCGTTGTTCCGGTTCGCATTGGAGGGTGAGCGCCTCACCTGCCGATT 

GCGATTCGACCCTGCGCGCTATGACCGTCCCGCGATCGAAAACCTCGCCGATCGTTTCGCCCG 

CTTCCTCACGCGCCTGTGCCGGGAC3CCTCCACCGTCATCCAGGCGCTGGACCTTTCGCTGCC 

AAGCGATGAATCGGTGTGGCGCGTCACTGAAGGCGTGCGGCGCGGCTATTCGCAAGACCTGAC 

GCTAGACCGCGCGTTCCGCCGCCAGC-CCGCGCAAACGCCCGATCAGCCGGCGATGACGTTGAA 

CGGGGACGTCCAGAGCTACGCCGAGGTCGACCGCCGCAGCGACGCGCTGGCCCGCCACCTCCG 

TCGCCACGGCGTCGGTCCGGAAACGATTGTGGCCGTCAACGCCCGGCGCGGGCCTAATCAGCT 

GACGGCCCTGCTCGCGGTCCATAAG3CCGGCGGCGCCTACCTGCCGATCGATGCCGAGGAGCC 

GGCTGCCCGCCAGCAATTCAAGGTGCGCGACAGCGGGGCGCGGTTGGCACTGGAGCCGTCGCC 

GGACCAGGCGCTGACCGTCACCGACCTGCCGCGGCTCTTCCTGGACGATGCCTCGCTCTTCGC 

TGACGGCGGGCTCGATGTGCCGCGCGGCGCCGACTCGCTCAATCCGGCCTATGTGATGTACAC 

GTCCGGCTCGACCGGACAGCCCAAG3GTGTGGTGGTTCCCCACCGCGGCGTGGTCAATCGTTT 

GAATTGGGGGCAGTCCCGTTTCCC3CTGGACGAACGCGACCGAATCCTCCAAAAGACGCCGCT 

GCTGTTCGACGTGTCGGTCTACGAGCTGTTCTGGGGCGCATGGAGCGGGGCCACCCTGGACAT 

CCTCGAGCCCGGCGCCGAGCGCGACCCCGACGCAGTGGCCAGGGCCCTGGCCGAGCGCGCCAT 

TACCGTATGCCATTTCGTGCCTTCGATGCTGCTCGTCTACTTGGAAGTCATGCGGCGGCACCA 

TGCGCCGCCCGTGCCGGACCGGCTCCGTTACGTCTTCGTCAGTGGCGAGGCCCTCGAACCGGA 

CCACCTCGCCGGGCTCCAGCAGATT3GTCGGCGCCTCGGCCGCACGATTCCCCTCGTTAATCT 

GTATGGACCAACCGAGGCCTCGATC3AAGTCTCCTGCTTCGCCTGTCCCGCCGACCATGTGCC 

GCGCCGGATCCCCATCGGGCAGCCGATCGACAACGTCGCACTGCACGTTCTCGACCGGCGCGG 

CCGTCGCCAGCCGCCCTATCTTCCT3GCGAGCTGTTCCTGGCCGGCGACTGCCTGGCGCGCGG 

CTACCTCAACCGTCCCGACCTGAC33CGCTCCACTTCGTGCCCAATCCCTTCGGCAACGGCGA 
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GCGCATGTACCACAGCGGCGACTTGGCGCTCGTGCGCGGCGACGGCCAAGTGGCGTTTCTCGG 
CCGCCGTGACCACCAAATCAAAATCCGTGGTCAACGGGTCGAACTGGGCGAAATCGAGAGTCA 
TTTGCGCGGGCTCGAAGGCATCGCCGCCGCCGTCGTCCAGGCCGAGTCGCAGCACCATGAAAC 
CCTGCTGCACGCCTACGTCGTCACCAACGACGCGGGCCTCAATGCGGCCCGGCTGCGCGCCGC 
CCTCGCTCAACATCTGCCCGAGTACATGATTCCCCAGCGCTTCTCGCGGCTGGCCGAGTTGCC 
GCTGCTGGCGGCAGGCAAGATCGACCGCGCCGCCCTCGCGCAACGTGCAACGCCGCTCGCCAG 
CGGCGCGCCCTTCGTGGAACCCAGCGGGCCCACCCAGCAGCGTATCGCAGAACTGTGGCGCCA 
GGTCTTAGCGGTCGCCGAAGTCGGCGCCGAGGATCCCTTCTTCAGCATCGGCGGCAACTCGCT 
CAATGTGCTCAAGCTCAGCGCCGCC-CTGAGCGACGCCTTCGCGCGTGACATTCCCATGCCGGC 
CCTGTTCCAATACGACACCATCGCC3CCCAGGCCTCCTGGCTCGACGGGCAGGTTGACGAACG 
GGCCCAATCCGCCGCGCTCGACCGGCAGGCCGCCGAGGCGGCGCTGACCCTTCAAGAGACCGT 
GGCC ATTTTTGAGGGATT CGATGAC GAACCATGACCATCACGAGGAGAGCAGCGGCCTGGAGA 

TCGCCGTCATCAGCATGGCCTGCCGATTCCCGGGTGCTGCCGATTGCGACGCATTCTGGGAAA 

ACCTGATCAACGGGACCTCCTCGATCACCCATTTCAGCGACGACGAGCTGATCGCGGCCGGCG 

TTGACGCGCGCGACCTGACGCCGCAGTACGTGCGCGCGGCCGGCCAGATCGATGACGCCGAAC 

GGTTCGACGCGGCCTTCTTTGGGTACTCCCAGCGTGAGGCCGAGCTGATGGACCCCCAGTTCC 

GCCTGCTCCATGAATGCGCCTGGTCCTGTCTGGAACAGGCCGGCATCGATCCGCGCGTCGAAG 

CCGCGCCGATCGGGCTGTATGCCGGCGCAGCCGACAACACCTACTGGAACGCGCTCTCGTCGC 

TCGACCGGGGCTCGGCCGAATCGGAGCAATTCGCCGCCGAACAACTTTGCAACCGCGATTTTC 

TGTGCACGCTGGTCGCCGCCGCGCTCAACCTGAAAGGCCCCGCGGXGGTGGTTCAAAGCGCCT 

GTTCGACCTCGCTGTTGGCGGTCCACTCGGCCTGTCGTGCGCTCCTGACCGGCGAATGCCGAG 

TGGCCTTGGCCGGTGGGGTGGCGCTGCGCTTCCCACGCCCGAGCGGTTATCGCTACGAACCTG 

GCATGATCTTCTCGCCCGACGGGGTGTGCCGGCCGTTCGACGCGGGCGCTAACGGGACGGTGC 

CCGGCGAAGGCGCGGGGCTGGTAGCGTTGAAGACGCTGAAACGTGCCCTCCAGGACGGCGACA 

CGATCCACGCCGTGATTCGCGCGACCGCGGCAAACAACGATGGTGCCCGCAAGACCGGGTTCA 

CCGCGCCCAGCGCCCACGGCCAAGCCGAAGTCATTCGCACGGCGCTGCGCCTGGCCCGGGTGC 

CGGCCGAATCGATCGACTACGTCGAGGCCCACGGAACCGGCACGCCGCTAGGCGACCCGATCG 

AGGTAGCCGGCTTGGTGGAGGCCTTCGCCAGCGAGAAGCGCGGCTATTGCCGGCTGGGCTCGG 

TCAAATCCAACCTTGGTCATCTGGACACTGCTGCCGGCATCGCCGGCCTGATCAAGACCGTGC 

TGG -GCTCGAGCACGCGC ACATC CCCAAGTCCTGCCACGTCGCCACGCCCAACCCCGCGGCGC 
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GCCTACACAAGACGCCTTTCCGCATTGCCGCCGACGGGATGGCCTGGCCGCGGCGTATGGCGA 

CGCCGCGGCGGGCGGCGGTGAGTTCGTTCGGCATCGGCGGCACCAACGTCCACGCGATTTTGG 

AGGAGGCGCCGCCCCGCGCGCCCGAGCTGGCGGACGGGCGCAGTCAGGTGTTCGTCTTCTCCG 

CCAAGGACGAGGCGGCGCTGGACCGTGCCCTTGCCAACTATGGTGCGGCCTTGGAGAAGCGCG 

GCGACCTCGCGGCGGGCGCGGTGGCCTGGACGCTCCAAAACGGCCGGGCCGCATTCGAATGGC 

GAGCCAGCGCGGTGGCATCCGACCTCGACGAATTGGCGGGCGCATTGCGCGGCGAGCGGCCCG 

GCGCCGTCAAGAAAAACCGAATGGCGCGCGAGGATAAGCCGGTGGCGTTCTTATGTTCGGGGC 

AGGGGAGCCAGTACCGTGGCATGGGCCACGACCTGTACCGCGAAGAGCCGCGTTTCCGGCACC 

ACCTCGACGCCTGCCTCGCCATCCTCGCCGAACACAAGCCCGAGATCGACTGGCTGGCGTTGC 

TGGGCTACCGCGACGAGGACGAGCCAACCGACCAGATCGGGACGTCCTCGCAGGGCCCGAGCC 

GGTCAGCCGCATCGAACCCAGCGGAGCTCCTCGACAGCACCGAATTCGCCCAACCTTTGCTTT 

TCTCCATGTCCTACGCGCTCGGTCGGCTGTGGCTCGACTGGGGCGTGCGACCCACGGCGATGA 

TCGGGCACAGCCTGGGCGAGTACAGTGCTGCATGTATTGCAGATTTCTATGCACTCGATCAGG 

TGCTGCCCTTCATTCTGACCCGCGGTCGAGTCATGGCGCAATTGCGGCGCGGCTCGATGTTGG 

CCGTCAGCGGTGACAGCGTTCTGATGCGCGAGCTGATCGCCGATGCGCTCGATTTGGCGGCGA 

TCAACGGCGCTGACCAATTTGTCTGGAGCGGGCCGAGCGAGGCTGTCCAAGCCGCGGGGGTCC 

GACTGCGCGGCGCCGGCCTGCGTGCCACCGAGCTGAACACCTCACACGCGTTCCATTCAGCCA 

TGATGGATCCCATTCTGGAGGAGCTAACGGTTGCCGGTTCGCGACTTCAGGTCGGTGTCGGGA 

CGATTCCGGTCGTTTCATGCGTTACCGGAACCTGGTTGACGGCGAAGCAGCTGGCCGATCCGC 

GCTACCACGCGCGTCACGCGCGCGAACCGGTGCGGTTCGCGGCGGGCCTAGCGACGCTGACAG 

GGGAGGAGCCGCCGCTGATGCTCGAAGTGGGGCCGGGCTCGACCCTGGCGGCTTTGGCCCGCG 

AGCATTCGAATGCCCGCCTCCCGGTCGTCACCAGCCTGCGCCACGCTCGCCAGGCGACGCCCG 

ATCGCCAATACCTGCTCGAAACGCTCGGCTGCCTTTGGCGACACGGGGTTTCCGTCGATTGGG 

GGGCCCATGCCGGACGTTCGCGACGCTTGGTTTCGCTGCCCGGCTATCCCTTTTCCGGCGCGG 

TGCGCCGCTTAGCCGGCGACCCCCTCCGCCTGCTGGCCGGAGCCCGCGCCGTCGCCGCCCCGT 

CGGGAACGCGCCAACTCAGCGCCGACGCGCGCGACCTCCCGAACACTCCGGAGCCGACATCCG 

GCGCCGTGTCGGCGATCAAAGCGCCAATCGCCGCCGCCGATCCCGGCCTCTATCGCCTCTCCT 

GGCGCCAGGCCGGAACGGCGCCGCTCGGTCCGCCCGATCTCGGTCCGCCCCGCGACTGGATCG 

TCTTCGCCTCTGATTCTCACCTGCTCCAGGCGCTCAGGGCCAATCTCGGGACGCGCGCTCAGC 

GGGTGACGCTGGTGACGCCGGGCCAGGAGTACGCAGCCGAGCCGTCCGGGTTTCGGCTGCGGC 
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CGGACCAGATCGACGATTACCGCGCCCTGTGGGCGGACTTGGCGCAAACCGGTATTGTGCCAC 

GATACATCGCGTTCCTCGCCCCGTTCATGTACCGGGCGCGCATGGCGGGCGATGCCTCGACCC 

TGGACGAAGTGCGCGAGGGCGGCTTCCTGCCCCTGACCCGCTTGATCCAGACTCGCCCGCCAG 

GCGGACCGAGCGGACTTCTAAGCCTCACGATCGTCACCCCGGCCGCCCTGGCGCTGGGCGACG 

AAGCGACGCGCCCGGAATGGGCAATCCTGCACGGGATGGTCGCCGGCTTAAGCCGCGATTATC 

CCGAATGGCGCTTCGTCTCGATCGACGGCGGCGACCCATCCCCGCATCGGTGCGAAGGTCTGG 

CCCGCTTGATCGCGCTTCATGCGGTCGACGAGGCTGGCCCGACCCGCTTGGCGCTGCGCGGCC 

TTCACGCTTGGGTTCCACAGTGCGAGCACGTTCAGCCGGCCACCATCCCTGGGGCGGGTATGT 

GGCGCGAGGGTGGTGTGTACATGATAACGGGCGGATTCGGCGGGATCGGTCTGGCGCTGGCCC 

GCGCCCTGGCTCGAGAAGCTCGCGCCAAGCTGATCCTGGTCGGCCGAAACCTGCCCACCGCGC 

CGATCGATCTCGAGGCTTGGGACGC3CCGCCGTTGATTCTCACCGCCGACGTCGCCGACGAAG 

AGGCCATGCGCCGCGTCTTCGATGCCGCGCACGCCCGGTTCGGCGCCATCGACGGCATTCTTC 

ACGCGGCCGGTGTCCCCGGTGGCAGCCTGTTCGCCAACCAATCGGACGCGGCCTTCGAAGACG 

TGCTGCACGCCAAGGTTCGCGGTACCCTCGTGCTGCAAGGCCTGAGGGCAATCGATGCGCCGC 

TGTTGCTGATGTCCTCGCTGGACGCCTGGCTTCCCGGTCCCGGTCAGACCGCCTATGCCGCCG 

CCAACGCCTTCCTCGACGCCTTCGCCAGTCTGCGCCGGCGAGAGGGAGAGCCGGTGTACAGCG 

TTGGCTGGGACAGTTGGTGCGAGGTGGGCATGGCTGCTCGGGTCGCTGCCCGATCGGCCGACG 

AACGCGGCCGCCTGGCGCGCGAGGGGATCAGCCCTCGCCAGGGTTGGCAGGCTTTGAGCCGGG 

CGCTCGCCCTCGACCCCCCCCACCrGATGATCTCGCGCACCGACCTGACCTCGCGCTGGCACA 

GTCGATCCAGCCCTACGCCGGTCGCCTCGAGCGAACCCGAGGTGGCGCTGCCGCGCTGGACCG 

CATCCGCCTGCCAAGCCGTCATCGAGCGTGTTTGGTGCGAGCACTTCGCCACCGCCGCCGTGC 

CTCCCGATGGCAACTTTTTCGAGCTCGGCGCCAGTTCCTTCGACATCGTCCAGCTCAGCGCTC 

GACTTCAACAACAGTTCGGCCGAGATGTCAGCCACACCGTGCTCTACAGTCATCCCACCGTCG 

CCTTGCTGGCCGGCTACTTCGCCAATGACCCGACGCCGTCCGGTGCTGCTGCCGACGAACGCG 

ACGAAGCGGTGCGTCGCGGCCGCGACCTCTTGAAGAGCCGCCGGCGAGGAGTATGACCGTGGA 

GCACGAAACCGGATTCGAAATCGCCGTCATCGGGCTGGCTTGCCGCGTTCCCGGCGCTGCCGA 

CGTGGCCGCCTTCTGGCGCAACCTGGTCGAGGCCAAGGAGAGCGTGCGCTTCTTCGAGGACCA 

CGAGCTGCGGGCCGCCGGCGTGCCCGAGGAGATCTTGCGCCTGCCCAACTACGTGAAGGCCAA 

GCCACTGCTCGCTGATGGCGAAGCTTTCGACGCGGACTTCTTCGGGTTCCATCCGCGCGAGGC 

CGCCTACCTGGACCCGCAAGTTCGGCTCCTGCACGAATGTTGTTGGACCGCGCTGGAGGATGC 
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CGGCTACGATCCCGCGCAGTACGCCTACCCGATCGGGTTGTTCGCGGGCGTCTCCAGCAATCT 

CTCGTTCCTGTTCGACCGCATCGAT2CGCGCGACTCCCCCCTGCAGAAGCGCTATGTGGCCGA 

GCTGAACGCGGCCTCCTTCGCCACCCAGATCGCCTACCGGCTCGATCTGAAGGGGCCGGCCAT 

TTCGATTCAAACCGCCTGTTCGACGTCACTGGTGGCGATTCACCTGGCGGCGCAAAGCCTGAT 

CGGCGGCGAGTGCCACATGGCCTTGGCCGGCGGAGCGACCTTGGAGGTCCCCAAAAAGCCCGG 

CTATCTCTACCGCGAAGGCTACATCAACTCGCCGGACGGCCACTGCCGGGCCTTCGACGCCGA 

CGCGGCCGGCACCATCTTCGGCGACGGCGTCGGCATCGTCCTGCTCAAACGCTACCGCGACGC 

CCTACGCGACGGCGATCACGTGTAG3CAGTGATCAAAGGCTCGGCGATCAACAGTGACGGCCA 

TCGCAAGGTGTCCTACACGGCGCCGS3CAAGAGCGGTCAAGTGGCGGTGATCCGCGCTGCGCT 

GGCGGCGGCCCAGGTAGAGCCGCAAACCATTCGCTTCGTCGAGGCCCACGGGACCGGCACACT 

CGCCGGCGATCCGATCGAGGTAGAG3CGTTGACGGAGGTCTTTGCCGAAGCGGGTCGCGGTAC 

CTGCGCCCTGGGTTCGGTGAAGACCAACATCGGCCACTTGGATGTGGCGGCGGGCGTGGCCGG 

TTTCATCAAGGCGGTCTTGGCGCTCGAGCGGCGCGTCCTCCCGCCCAGCCTTCACTTCGTCCG 

GCCCAACCCGGCCATCGATTTCAACGGGCCCTTCTACGTTTGTCGCCAAATCGAGCGGTTGAC 

GGAGAACGGGCGGTTGCGGGCCGGGGTGAGTTCCTTTGGCATTGGCGGCACCAATGCCCACGT 

GATTCTGGAGGAAGCGCCGGCGCCGGAGGCGAGACTGCCGGCCGGGAGCCCGCCAGGCGCGAG 

TCCGTTCCTGTTCCCGCTATCGGCCAAGACGCCGGATGCGCTGGCAGGCCGTTGCCACGACCT 

TGCCGACCACCTGCGGGCGCACCCC3AGCTCCTCCTGGCCGATGTGGCCCTCACTCTGCAGAT 

GGGGCGGGCGTCGTTCGCCTACCGCCATGTGGTCCAGGCTGCGACGGCGGAGGAGCTGATTCG 

CGGTCTGGGAGCGTTCCGACAGGAGTCCATCCGCAAGAGGCGGAATCGAGTACAATGGGTGTT 

GGCAGGCGAGGCGATGTCGCTTGAC3CCGGTTTGCGGCTGTACGCCGATTGGCCGGTCTATCG 

GGAGCGGGTCGACGTCTGTCTGGCGATCGTCGCCAAGCTGCGCCAAATCGACGGCCGGTCATT 

CCTACATGAGTGGATCGAGCGACCGCGCGAGGTTCCTGCCGAATGGTCGACGGCGCTGGCGTT 

CATGTTCCACTGCGCGCTGGCGCAA3CCCTGAGCCAGGCCGGCCTGCACCCGCAGCGCATGTG 

GAGCCGTGGGCTGGGCGGACAGGTCGGCGTGGTTTTGGCCGAATCCCTGTCGTTGGAACAAGC 

GCTGGCGCTGGTGTTGTGCCAGACACCGGTTCCCGGCGATGCCACACCTCAGCGCGAACGCTT 

GGTTCGGACACTGGAAGGCTGCCG3TTTCGTCCACCACGATTTTTGATTTCGGCAGACAGCTC 

GGGTCGACCCCTGGACCTCGCCGAATTCGCTCATGTCGATTTTTGGTGCGGTGGCCAAAGCGC 

CTCGCCCAATGAGGCGGAGCTGCGC7CATGGAGCGACGCCGCGCCCGAGCTGGTGACCTTGGC 

GATCGGCCCATCCTTTCTCGAGGC C GCCTCCGGGACGGTGGGTCTGGCGATCGACCCCAAGCG 
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ACCGATGACCTGTGTTCAGCGCACGGTGGCCGCGTTGTGGGAATGGGGATGTGACGTGCGCTG 

GGCTGCGTTCACCTCGTCGACCGGGCGTCGGGTTCCCCTGCCTACCTATCCCTTCGTGCGGGT 

AATTCCCACGATCGGCGACCCCCTTCGCGGAGCAGGCGCGGAGGATGACTTGATTGCGGCGAG 

CGCTTCCGCGTCGGCCGGATCGCCGCCCGAGCCGTCGGCAAACTCGGCAGCGGAACGCCCACG 

CGCCCAGTCAAGCATCGCCTCGGCAACCACACCGGCTCCGTCTCATACGTCGGCCAGCGTGGC 

CGTGGCCACCATTCTCGAAACCGTCCGTGCCTATTTCGGGTTCGCCGCCGTGCGTTCCACCGA 

CGCCTTCTTCGAATTGGGCGCGTCCTCGCTGGATTTGGTCAACCTGGGCCAGCTCCTTTCCGA 

TCGTCTCGGCCGCGAGGTTCCGACCCTGCTCCTCTACGACCACCCAACACCGGACCAGTTGGC 

GCTGGCCCTGACATCCGCGGCGCTCAGCGCAGAGGCGCCGCCCTTAAGGGGCGGTCATCGCGC 

ATCGACTTCCGGCACAGCCGCGAGCTCGGCCGCCTCCACCGCACCGACGTTCCCGGGGGACGC 

TCACTCGCAGCCCAGCTTCGTTCGCGAGCAGGACATCGCCATCATCGGGATGGCCTTCCGGGG 

ACCGGGCGCCGACGACCTGGACGCGTTCTGGAACAACCTGGTCGAAGGGGTCGAGTCGATCAC 

CTTCTTCAGCGAGGACGAGCTGCTGGCGGCGGGCGTCCCCCGCGAACATCTGGCGTCGACGCG 

CTACGTGCGGGCCAAGGGGGAACTGACTGGGATGATGGATTTCGAACCGGAATTTTTCGGTTA 

TTCGGCGCGCGAGGCGGCGGTCATGGACCCGCAGTTCCGCGTGTTCCACGAATGCTCCTGGCA 

CGCACTGGAGCACGGCGGCTACGATCCGACCCGATGCGCGGCATCGATTGGCGTCTACGCCGG 

CGTGACCAACCACCTGCCTTGGCTGATGCGAACTTTGCCGCACCTGACCGAGGAGGAGCAATT 

CGGCGCGCTGCTCCTCACCGACCGCGAGTTTTTCGCACCGCTGCTCTCCTACAAGGTCGGCCT 

GCGCGGACCCGCTATTTCGCTGCAAACCGCCTGTTCGACGTCGTTGGTGGCGATCGGCACGGC 

CTGrCGCGAATTGCGCGCGGGTGCCTGTCAGATGGCCCTAGCGGGCGGCGTGACGGCCAGCAT 

CGA3CGCTGCGGCTACTTCCACCAAGAAGGCTACATCCTCTCGCCTGACGGCCACACGCGCAG 

CTTCGACGCGGCGGCCGCCGGCACGGTCTTCGGCGACGGAGTCGGCATGGTGCTGCTGAAGCC 

GCTGGCCCAAGCCTTGGCCGACGGCGACACGATCCACGCGGTGATCAAGGGAATCGGCATCAA 

CAACGACGGCGCGCGCAAGGTCGGCTTCACCGCACCTAGCCGGGCCGGTCAGACCGAGGCGAT 

TCGGGCCGCGCTGCGCGACGCCGGGGTGGCGTCGAACCGCGTCAGCTACGTGGAGGCGCATGG 

AACCGCGACCAGAATGGGCGACCCGATCGAGGTCGAGGCCTTGACCCAAGCCTTTCGCGCCGA 

AGCCGACGGTCCGCTTCCGCCCGGCTCCTGCCTACTCGGCTCGGTGAAGTCCAACGTGGGCCA 

CCTGAACGCCGCGGCCGGCGTGGCTGGTCTGGTAAAAACCGTGCTGGCGCTCCAACACCGCCG 

CCTGCCGACCAGCCTGTTCTACCAGTCGCCCAATCCACACATCGACTTTGCGGCGAGTCCGTT 

CCGCGTGAACGGCCAGACTTCGGATTGGGTCGCGCCAGAGGGGACGCGGTTGCTGGCGGGAGT 
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GAGTTCGTTCGGTATCGGGGGAACCAACGCCCACCTGATCGTCGAGGAGGCGCCGAAAGCGCT 

ACCGACGACAGCGGCACCTCTGTCGACGGAGCCGAATGACCTCGACGCGGGCGACGCCGACGG 

GCTAGTGCTGCCGATCTCGGCCCGCACGCCGACCGCCCTGGCGCACATCGCGACCAACCTCGC 

CAATCACCTGGAACGACATCCGACCATCGCCCTGGCCGACGTCGCCCTGACCCTTCAGCTGGG 

CCGTCGCCAATGGCCCCATCGCCACAGCCTGATCTGCCGGAATCGAACGGAGGCGATCAAGCT 

GCTGCGCGCCGTCGTCCACTCCGCGGAGGTGCCGCCAGCTCAGGCGCCGGTCTCGGATGCGCC 

GCGCTGTGTTTTTCTTTTTCCCGGCCAGGGCGCCCAATACCCGAGCATGGCCCGCGACCTGGT 

TCGAAACTGTCCCGACTTCGCCCTGCACCTGGACCCCTGCCTCGACCAGTTGGCCGAACTGCT 

TCCCGAAGATCCGCGTTGCATCCTGTTCGGCGATGGCCCCGCCGATCGGCTCGACCAGACGGC 

CTACACTCAGCCGCTGCTCTTCTCCGTGTCCTACGCCTTGGCGCGCTGGTTGGGCGATTTCGG 

CATTCGCCCCGATGCGATGATCGGCCACAGCCTGGGCGAATACGTGGCGGCCTGCTTGGCCGG 

GCTTTTCTCGCTGAGCGATGCCCTGCTGCTGGTGAGTGAACGCGGCCGCCTGATGGGCTCGGC 

CGCGCGCGGAGCGATGCTGGCCGTCCCCTTGCCCGAATGGGAACTGGAGGAACGCCTGGAGCT 

TCTGGCCGACGACCGAATCAGCATCGCGGCGGTCAACACCGCCGAGAGCTGCGTCATCGCGGG 

ACCCAGCGAGGCGATCGAGCGCTGCGCCCAGCGCTGGGCCGCGCAAGGCCTGACCTGTACGCC 

GCXGCGCACGTCCCACGCCTTCCACTCCGCGATGATGGAGCCGATTGTCGAACCCTTCGGCCA 

TGTCTTGGCACGGGTCACCTTCGCGCCGCCGCGCGCGCGCTGGATCTCGAACCTCGACGGCAA 

GCCGATCGATTCCGCGGCGGTGATGCAGCCCGACTATTGGGTGCGCCACCTGCGCCAACCGGT 

CCGCTTTCACGAGGGACTCAGTCACCTGTTGGCCGAGGACACCCATGCTTGGGTCGAAGTGGG 

TCCCGGCCGAACCCTGTCCTCCTTCGTCCGCCGCCACCCGGCCTACCGTCACCAGCCAATCGT 

CAACCCCATGCGCCATGCAGTCGAGTCGACGGGCGACGTGCGCCGGTGGCGCCAAGCGCTGGG 

CGAACTATGGCGGGCCGGCATGCCGGTCGCCTGGGAGCGGCAGCGGCGCGGCCGGCATGCCGG 

ACGACGTGTGCCGCTGCCGGGCTACCCCTTCGAGCGGCGGCCCTTCGCGGCCCGAAGACCGGT 

GGAGCTGGCGCAGCCCGCGCCCAAGGCGGAGCTGGTGAAAAACCCCGAT'CCCGCGCGGTGGCT 

GTACCGCCGCGTCTGGCGCCCTGCCCAGGCTGCGGCCGGCGGACTGGCGGTGCAGGCGACCGT 

TCTGGTCTTCGGCGACGGGTCCGAGCTGTGCCGCGCGGCGGTCGCTCAGGTGCAGCGCCAGGG 

GCTGAAGTGCGTCTCGATCACCGCGGGCCGCCAATTCGCGCGGGAGAGCGACATGCGCTTCAC 

GCTTGACCCCGCTGATCCGCGCCAGCTCGACCAGCTCTTCGCGGCCCTCGATGGCTCAGGCTC 

GCGGCCGCGGTACGTCCTGCACCTGCTGACCCTGAACCCGCCCCCGGATGCCTCGGCGATCAT 

CGCTCACAGCTACTACAGCCCGATGGCCTTGGCTCATGCCTTGGGCGCCCACGAGATCGCGCC 
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TGTCTCGATCACCGTCGTCACCGCCGGGGTCGTCGCCGTCGCGGACGAAGCGATTCGCGAGCC 

GCTGCAGGCGCTGATCGTGGGCCCGTGCCTGGTCATCCCGCAGGAGTTTCCCGGGCTCAGCGT 

TCGGCTGCTGGACGTCAACGTCGACGATCCGGCACCGCGTCTGGCGGAGCGGCTCGTGGCCGA 

GCTCTCGGGCACGGATCACATGGTGGCGCTGCGCGGCGGCGAGCGCCTAGTGGCCGATGTCGA 

TCAAGTCGATGGCCTCGGTGTGGGGATCGCCAAGGTGCCCTTGCGCCGCGAGGGCCACTACCT 

GATTCTCGGCGGCCTGGGCGATATCGGCTACCACTGTGCCCGCTATCTGGCCCAAACCTACCG 

CGCCAAGCTGACGCTGACCGCGCGTTCGTCACTCCCGCCGCGCGCGTCGTGGGAGCGAATGCT 

GCGCGAGGGAAACCTGGATTCCCGGCAGCGCACGCGCATCGAGCGCGTGTTGTCGCTAGAGGC 

GTGCGGGGCCGAAGTCCAGACGGCTGCGGTCGACTTGGGCGATCGCCATCGCTTGGCCGATGT 

GTTCCGCGAAGCACGGGGCCGATTCGGCGCCATCGCGGGCGTGATTCACTCGGCGGGGATTCC - 

GGGACACGTCCACTCGATCGACGAGCTGGTGCGCGTCCGCGACGAAGCCCAATTCACCGCGAA 

GGTTCGAGGGCTGCACCACCTGGCCGAGGTCGTCGATCCGCTGAACCTCGACTTTTGTCTGCT 

GTTCTCCTCGCTCTCGACCGTCCTCGGCGGGCTCGGCTACGGCGCCTATGCAGCGGCCAACGC 

CTACATGGACAGCTTCGCCCGCCGCCACGATCGGCCGGACGAATGTCGTTGGATCGCGGTCAA 

CTGGGACGCCTGGCTGTTCGAAGCCAAGACGTCGTCGGTCGGCGCCGAATTGGCGCGCCTGGC 

GATCGTGCCCGAGGACGCTCCGGCCCTGTTCGCGCGGGTGCTAGAGCGACTTCCGCAATCGTT 

CATCGTGTCCACCGCCGACCTTCGC-GCCCGCATCGACACTTGGATCCGGGACAAGAACCGCGT 

CCCGCCCGCCGAGATCCGAGCGGTTCAACCGCGACCGGACCTGAGCCAGGCGTACGCCCCGCC 

GATCGGCCCGCTGGAGATTCAACTCTGCGGGCTGGTCTCCGCCTATTGCCGGTTCGACCGGAT 

CGGGCGGGACGATTCCTTCTTCGAAATCGGCCTCAGCTCGTTCGACTTGATCCAGCTCAGCTC 

GCGCATTCACCGCATCACCGGCAAGGATCTCAATACGACCCAACTGTTCAGCTACCCCACCGT 

GCGCGCCTTGGCGCTCTTCCTCGGCGGCGAACCGGAGGGGCTCGCGGCGGAGGAGCCCGCCAT 

GGAGAACCTGTGGCTGCAACGAAGCGATGCGACCCTCGATGAGTGAGACCGAGGTCGCCGACT 

GCGGCGCTACCGACCGCGGTCGAGGATTTTCCGCGCAGCGATCCGGGACGACTCGCTGAAGAA 

GCGCGATAGAAGAACGGAATCGTGTATGAAATACGAAACCACCGGATTGGAATTGGCCGTCAT 

CGGTCTCGCTTGCCGCTTTCCAGGCTCACCCGATCCCGAACAGTTCTGGTCGAATCTGCGCGC 

AGGTCGCTCCGGAATCCGCCATTTCAGCGATGCCGAGCTGAGCCACATCCCCGCATCCGTGCG 

TCACCATCCGCATTACGTCAAGGCCAAAGGCGCGCTGGACCACGCCGATTTCGAACCAGCCTT 

CTTCGGCTACTCGCCCAAAGAGGCCGAGGTGATGGACCCTCAATTCCGGCTGCTCCATGAGTG 

CTGCTGGGAGGCGCTGGAGTCAGGCGGCTATGCGCCGAGCCAATTCGCGGGTCGGATCGGCTT 
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GTTCGCGGCGGCGGCCTTCAACGACGGATGGATCGCCGGTACCCTCGACCGGCTGCGCACCGG 

CGTGGGTTTGAGCTCCCTGGAAACCGCGTTCTTGACCCTGCGCGATTACCTGACCACCCAGAT 

CTCCTATCGGCTCGATCTGCGGGGCCCCAGCCTGCTTGTCCAAACCGCCTGCTCGTCGTCGCT 

GGTGGCGGTCCAGCTCGCCCAGCAGGCGCTGATCTCCGGCGAATGCGCCCTGGCCTTGGCTGG 

CGGCGTGTGCGCGACCGATCCGCTGCATTCGGGATACCTCTATGAACCCGGCAACATCTACGC 

GCGCGACGGCGTCTGCCGACCGTTCGACGAGGCAGGCGCCGGTACGGTCTTCGGCGACGGGTG 

CGGCATGGTCCTGCTCAAGCGGCTGAGCGACGCCCAGCGCGACGGCGATACGATCTGGGCGGT 

CATTCGCGGGGCGGGCGTGAACAACGACGGGCACCACAAGGTTGGCTACACGGCTCCTGGCAC 

GAGGGGCCAGGTGGCTTTGCTTAAAAGTGTTTATCGCGCGAGCCGGGTCGACCCGGCGACGCT 

CGGCTACCTGGAGGCCCATGGCACCGGCACCGCGCTCGGCGATCCAATCGAGGTCGAGGCGCT 

TACCCAGGCCTTCGCCAGCAAACGTCGCGGCACCTGCGGCTTGGGCTCGGTCAAGGGCAACCT 

GGGTCACCTCAACACGGCGGCCGGCATCGCTGGACTGATCAAGGTGGTGCTGGCGCTGAAACA 

TCGCGAAGTGCCACCCACCCTCAATCTGCGCCGTCCCAATCCGAAAATCCGCTTGGACGAGAC 

GCCGTTTTTCCCAGTCGTCGAGTTGCAACCCTGGCCAAGCGGGACCGGCCCCTTGCGAGCCGG 

CGTGAGCTCCTTCGGCATCGGCGGTACGAACGCCCACGTCATCCTCGAGGAGGCACCGCCGAC 

GGCCAACCCGGCGCCACACGGCAGATTCCGACTGTTGCCGCTTTCGGCCAAGACACCGGCTGC 

GCTCGAAGCGAAGCGCCGCGATCTGGCCGGCTTCCTCGAACGCCACCCGGAGACCTCCTTGGC 

CGACCTCGCCTTTACCCTGCAACGCGGCCGCGAGGTCTTCAGTCACCGCGCCTGCCTCGCCGT 

GGAGACCTTAACGTCCGCGCGCACGCGGCTGAGCGGCGAGTCGTCGAGCACTTGCGTGGTGGG 

CCCCGCGCCCAGCGCCATATTTCTGTTCCCTGGTCAAGGCAGCCAGCTCGCCGGGATGGGCCG 

CGGTCTGTATCACCATTTCGAGCCGTTCCGCACGGCCGTCGATGCCTGTCTGCGCGAGCTGGA 

GCCAGGACTGCGGCAAGCGCTCAGCGCCCATTTCGATCCGAATCGCGGCGCGGACCCACCCGA 

TTCGACGACCTTCGTCCAACCCTTGTTGTTCCTCGTCGAGTACGGGGTGACCGAGTGGCTACG 

CTGCTTGGGTGTGCGGCCAACAATGGTGTTGGGTCACAGCTCTGGCGAGTATGCCGCAGCCTG 

CGTCGCGGGCGTTCTGTCGCCGTCCGCGGCGGTCTCGCTGCTGGCCGAGCGCGAGCGGCTGCT 

GCGCGACCTGCCAGCCGGCGCCATGCTCGGCGTCCCGCTGGCCGCCGAGGCGCTCGAGGCGAT 

GTTGCCCGACGCTCTCGATCTGGCGGCGATCAACGGCTGTCAGCTTTGCGCCGTGTCCGGGCC 

GGTCGCGGCGGTCCACGCCTTCAAGGCCCAACTGGAAGCCGCCGGACATCACGCCCGCCTGTT 

GCACACCGATCGCGCCTTCCACTCGCGGCTGGTAGCACCGGTGCTTGACCGGTTCCAGGCAGC 

CGTTCAACACGTGGAGCTGCGGCGGCCGCAAGTACCTTACCTCTCGACCGTCAGCGGGCGATT 
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GGAGGCGGATGGGCCGGCGAACCCGCACTACTGGGTGCGTCACCTGCGCGACACGGTGCGGTT 

TGGTCCAGCCCTGGAGGCGCTGCCGCCGGTGGATTCCTTCGTGTGCATCGAGGTGGGACCAGG 

CTCGGCCTTGAGCACCATGGCGCGCGAAACGTTGGGTTCCCAGGCGCGACTGATTTCGTTGCT 

GCCGCGGCCGCGAACGGGGCAAATCGAGCCCGGTCCGGTATTCGAACGACTGGCGGCGCTTTG 

GCGCAGCGGGTTGACATTGGATTGGTCTAAATTGACGGGCGGCGAAGAGGGTCATCGAATTCC 

CTTGCCAGTCTACCCGTTTCAGCGCAGCCATCTGTCGAGCTCCCTGGCGGCGGGCCACACGCC 

TTCGTCGCGGCCTGCAGTCGAATCAGGCGCCATCCTTGCCGAGCGATCCGCAGGGGAAAACGC 

TGAAACCCGGGATTGCCCGCTGCCAACCGCCACGCTCGAGCCCAAGGCGGTCGCTCCGGCCCC 

ACTCGAGGCTACCGACGCCGCAGGTACTCGCGAGCGACTGGCCGAACTTTGGCGCGAGTTGCT 

AGGGTTGACCTCGATTGGGCCCGACGACCATTTCTTCGACCTGGGCGGCCACTCGCTGACCGC 

CACGCGGCTGCGCGCCCTGATTCACCAGCGGTTCGATGTCGATCTCGGGCTCGACGAAATCTT 

CGCTCATTCGCGTCTCTCCCAGCTGGCCGCCCGTATCGAGGCGGCGGCCAAGAGCCGATTTTC 

CTCCATTCCCAGCGCGCCGGACCAGGACGACTATCCCTTGTCATCCGCCCAGCAGCGGATTCA 

CAGCATCGTCACGAGGGCCGAGGTCGGCACTGCTTATAATTTTCCGATCGTCCTCGAGCTGCA 

GGGCGCTCTGGATCGAGTGCGATTCGAGGCGACGTTCGCGGCATTGTTCCGGCGTCATGAGGG 

GTTCCGCACCCGCTTTGTGATGCGCGATGGCGGGCCGCGCCAGCGCATTGTACCGGACGTGGC 

GTTTCGCCTGCCGCTCACCCAGGTCGAGCCAGAGCAGGTTCCCGGGCGCATCGAGGCCTTCAT 

CCGTCCCTTCGATTTGGAACGCGCGCCGCTGTTCCGCGCGGAGCTGTTGCAGTTGGCCGAGCA 

GCGCCATCTGCTACTTTTCGACATGCACAACTTAATTGCCGACGGTATCTCGCTCAACCTGTT 

CGTCGCCGATTTCGCGGCCCTGTACCATGGTCGTCCGCTGGCGCCGCTGAAACTCCGCTATCG 

CGACTATGCCGTTTGGCAAGAGGCGCGGCTGGCCTCCGATGACCTGCGCAGCCAGCGCGAATG 

GTGGCACCGGCGGCTTTCGCCGCCGGTCGCCACGCTGGCGCTCCCTCCCGATTTCCCGCGTCC 

GGCGGTGCGCCGCTACAAGGGCCGTAATGTGGTGTTCCACCTGGACCGGGAGATCCGCGACCG 

CCTGGTGGCCCTGGCTCGAACCCAGGGGGTCACCATGAACGTGATGATGCTGGCGCTCTGGGC 

TGCGCTGCTGCATCGCGAAACCGGCCAATCGGAGCTGGTGGTCGGATCGCTGCTCGGCGGGCG 

GCCGCACAGCGAGCTGCATCCCGTGATCGGGCTCTTCACCAACTTTTTGCCCTTGCGGTTGGC 

GGTCGAGGGATCGACCCGCTTCGATCGCTTCCTTGCCGCTTGCCACCAGGTGTTTCTCGAAGC 

CTATCAGCGCCAGGACTATCCGTTCCACTTGTTAGTCCAGGAACTCGTGCCGGTCAGGGACCC 

GTCGCGGTCGCCGCTGTTCCAGACCTCGCTCGTCTACCACAACGAAATTGACGGCAAGACCAA 

GCTGGAATTGGAAGGGCTGAAAGTCGAAGTGGTTCCCTTCGAAAAGGGTGTGGCGAGGCTGGA 
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TTTGAAGCTGGATGTGACACCTTTTTCCGACCGACTCGAATGTGTTTTGCAATACGACTTGGA 
TCTGTTCTGCGAGGAGACGATGCGCGGCCTGATCGCGCGGTTCCAGGCGTTGGTGGCGGGGCT 
TGTCGCCGATCCGGCGCAATCGCTCGCCGCCGCGAGCGTTTCCGGGAAGCGGGCGCTGCGCGC 
GGGCGTGGCCACGGCAAGCGAATCGTCGCCGCAGTCACTGCCGCCGCAACCATCGACGGCGTA 
CGCCACTCCCTCACCGCAGTCACCGTCGCCGGTAGTCCTGACGGGACCCGCCGACCTGCCCGC 
GATCTTGGCGGCCTACGTGGGGCAGAACCCCCATCCGTTCGCGATCCATCGGGGTCTCATTTT 
GGAGGCGCCGCTGGGGTTGCGAGCGCTGCGGTCGGCGCTGGACGCAGTGCTCGGAGAACACAC 
CCATTGGCGCAGCGTGCGTGCGGGCGATCGCGCGCGGCGCGTGGATAAGTTGGAATTGACCAG 
CCTGGTGCGGCTCGACGACCTGCGCGGGTTGGTCAATCCTCAGGCGAATGCCTTCACCCTGGC 
TTGGCGCGATCTGGCGATGCCGTTCGGGGAGGGGCGTCCCCTGTGGCGACTCCGCCTGGCGTG 
GTCGGCTCCATCGCGCTGGTTGCTATTGCTGACGGTTCATCCATTGATCGGCGACAACGGCAC 
GGTCGACCTCTTTCTGGCGGCACTCGCCGATCACCTGCGCCGCGCGTCCGCTTTTCCCGTAGC 
ACCGCTCGATGAGGCCGAGCTGGAGGCGGAGCTGAAGTGGGGAGAGGAAGGGGAGGGCCTCGG 
GCTGACCGCGATCGCGCCGGTCCTGGGCCAATTGCGCGAAAGTCGGCTGAGTCCTGTGGCCCA 
GATGTGGCTGGACGAGGTCTGTCGCCGCCACGACCTCACCCCGCTAGAGGTCTTGGCGGCCCG 
GCTCCTCGATTGGACACGAAGCCACGGTCACGGGTCGATCGCTTTGTGGACGCCGCTGCCCGA 
GGACCATCCGCTTCGCGATGAAGGCCGCTGCCTCCAGGTTCGCCTGCTGGAGGGGCCGCCGTC 
GCAGCGAGGAGCGGGCGATCCAAGCTGGCTCGAGCAAATCGCCTTGAGACGGGGTACCCCTGC 
AACGGAGGTCGTTTGCCCTACTCCGACCCAACGGGCAGCCATCGACCTCGCGCTGGCCTGGCT 
GCCGCAGCCGCCTCTTCACGGTTTGGTCGGAACCGTTCAGCCGTGGCCGGAATCTCCATTGGT 
CTGTCCGTTTCCCCTCAATCTCGCGTTCCGGCCAAGCCATCCAATTGCCTACGCGCTCAAGCA 
CGAGGCCACGCTCGCGGTCACGGCACGGGCGCGCGATCTGATGCGTTTCCTCGACGGCTTGGG 
CCCGGAAAGCTGAAGATTAGCATAAGCGCCCGGCCAAGGGCATCCTAGGATGACGCAAGCCTC 
GGCCGCGTCGACGTCCCAGGTCGCGCCGGAGGTCACCCCCGGCCGAAAGGACGACGATGACGA 
TCAAATCCGAGATGTCGGCCGTTGCTCACTCTGCGGAGAGCGGCTTCCGCGCTGGGCCACGCG 
TGGGCGGCGCGATGAAGCGGGGCCGGACGCCGGAGCAGGCCGGCGTGAAGCTGCTCCGCGCCC 
CGGTGAAGCGGAAGXGGCTGCCCCCGGCGCCCGTCCTGCGCCTGAGCGAGCGGCGTATCCCGG 
AGGTGTGGGCAGGCTACCGCGCGAGCGCGGGATGACCCGAGCCCCGCCCGCCGGCGCGACCAT 
GACGCCGCCCCACGGGGCGAGTCGTCCGGCGCGCCGGCGCGCGTCGGGGCTTCCGCCGCCGGG 
CGGGCAGGTGCAGGATGGTCGGGCATGGTGACGCGTCCGACGTCCGACGGCATCGAGGACGAG 
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CTCGCGCCGTTCCCCCCGGTCCTGCGCGGCTGGCTCATCGAGGGCGAGCTCGGCCGCGGCGGG 

ATGGGGCGGGTGTTCCGGGCGCGGCACCCGAAGACGCGGGCGCGGGCGGCGATCAAGGTGCTG 

CTCGGCGACTACGCCCGCCGGCCGGACGTGGTGGCCCGCTTCCGGCAGGAGGCGATCGCCGTC 

AACATCATCAACCACCCGGGAATCGTCCGCGTCTTCGACTCCGGCGAGCTCGAGGACGGCTCG 

CCCTACATCGTGATGGAGTACCTGGACGGCCGGGGGCTGCGCGACTGGGTGCAGGCCGTGCCG 

CCCGCGGAGCGGCCGCGGCAGGTCGTGCGGCTCGGCTACCAGATCGCCTCGGCCATGGCCGCG 

GCGCACGCGTCCAAGGTCGTCCACCGCGATCTGAAGCCGGAGAACATCATGGTGGTCGAGGAC 

GAGCTCGCGCCCGGGGGCAGCCGCGTCAAGATCCTCGATTTCGGCATCGCGAAGGTCCTCTGG 

GGAGGTCTGCCCGAGGTGCTGGAGCTCGAGGGGAGAGGCTCCCTCGCGCCCGCGTCCGCGTCC 

ACGATCCGCACCGAGCTCTGGACGCGGCCGGCGCCGACGGTGGGCGCCACGACCGGCCCAGAG 

AGCCCGCTGGGCGCGAGCGCCACGCCAGAGAGCGCCCTGGGCGCGAGCGCCACGCCAGAGAGC 

GCCCTGGGCGCGAGCGCCACGCCAGAGAGCGAGGCCCACGAGGAAGACGCGCTCCGGAGCCTC 

CCCGTCGTGACCAGCGGCAGGCCCGCGATCCACCCCGCGCCGGTCGAGATCCCGGCCGAGGCG 

GTCTCCTCCGCGGCGTCGCGCGGGTCGCGCGCGTCGATCGAGCCAGGCGCGCCCGCGCCGCAG 

AGCGAGGGCGCGGGACAGCCCACGATGCCGTTCACGCAAGAGGGCGTGTGGGGCCTCGGGACG 

AGGAGCTACATGGCGCCGGAGCAGGAGCGCCACTCCGGGAGCGTGGACGTGAAGGCGGATGTC 

TACTCGCTCGGCGTCATCCTCTATGAGCTGCTCGAGGGGCGGACGCCCGACGCGCCGAGCGCC 

GCGTGGCCGCCCCCGATGAGCGCCGCCACGCCGCCCGATCTCGTCGCCCTCGTCCACCGGGTT 

CTGGCGTTCGATCCCGATGCGCGGCCGCGCATGGCGGAGGTGGCGAGCGCGCTTCACCGGCTC 

GGCCGGGCGAAGAAGGAGCTCGACGAGGCGCTCTCGAGGTGGGTCGTCGGCGGAGGGGCGCCG 

GGGCTCTTGCCGTGCGGCTATGCTCTTCTCGAACTGGTCCTCCTGGGCCCTGGGAACTTATAC 

GATTCTTTCCAGCCTGTAAGTGCATTT7TCTTTCAATATCGTCCTCTCTTCATATACGAGGTG 

AGTTCTCTGAGGTCCTCCTATAAGTCTGGGGTGTCCTATTCGGCCTCTTACTTGTTACTTCGC 

CTTCTTAGGAGTTTTTCCTTAATTTTGCCCTCTTACATTCCCGTATTCATTCTAACTGGGCCC 

TATCTCATTCGCTAATACGTTTCTGTATTGTGTACATCTCCTATCATGTGTCAATACTTGTTT 

CTGTTTATCATTATTCTTATTGTTTACGCTCTTATTTCATTCATAGTATAACATTAGTTTACT 

GATTATCGCACTTGAATTCGCG 



or its complementary strand, 



5 
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(b) DNA- sequences which hybridise under stringent condit 
to regions of DNA- sequences according to (a) encoding protei 
cr to fragments of said DMA- sequences, 



ions 
ns 



(c) DNA-sequences which hybridise to the DNA-sequences accordr 
ing to (a) and {b} because of a degeneration of the genetic 
code, 

(d) allele variations and mutants resulting by substitution, 
insertion or deletion of nucleotides or inversion of nucleotide 
segments of DNA-sequences according to (a) to (c) , wherein the 
variations and mutants offer isof unctibnal expression products. 

8. DNA sequence according to claim 6 selected from the f oi- 
ls lowing 



(a) open reading frames: 









Nucleotide Pos-'tion 
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No 
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No 
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No 
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25 


ORF6 
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Seq 


ID 


No 


8 
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Seq 


ID 


No 


9 
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ID 
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ID 


No 


11 
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ID 


No 


12 
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ORF12 


31982 - 


- 32932 


3eq 


ID 


No 


14 


ORF13 


33128 


- 33613 


Seq 


ID 


No 


15 


ORF14 


33661 


* 34007 


Seq 


ID 


No 


16 


ORF15 


35611 - 


- 35255 


Seq 


ID 


No 


17 


ORF16 


37856 - 


■ 35730 


Seq 


ID 


No 


18 



or DNA sequences complementary to said open reading frames, 

(b) DNA- sequences which hybridise under stringent conditions 
to regions of DNA sequences according to (a) encoding proteins 
or to fragments of said DNA sequences, 

(c) DNA-sequences which hybridise to the DNA-sequences accord- 
ing to (a) and (b) because of a degeneration of the genetic 
code, 

(d) allele variations and mutants resulting by substitution, 
insertion or deletion of nucleotides or inversion of nucleotide 
segments of DNA-sequences according to (a) to (c) , wherein the 
variations and mutants offer isofunctional expression products, 

and peptide sequences corresponding to said open reading frames 
SEQ ID No 19 (>0RF1) 

VDPEREAVTLGLAFNRAQGRTYARGPEARAEYIGTAMRAADVI EDRFE I ERLAVSGGMGDVYR 
ARDRVSGQAVALKVLQGASANDI^RFAREAEALVTLRLPGVVQYVAHGVTGAGRPYLAMEWLD 
GVTLEERI^GAPLTIAESVAI^ARVATTLGAIHWLG 

I ARHLRLAPTLTS PGAVLGTPGY I APEQVRGDAPVDARDVFALGCVLFQCLAGRPPFLGNS AL 
ALLMRWLEEPPRLGELRDGI PEPLERLVARMLAKNAGERPRDGAAAAAELAAVAGEGLS IGA 
SAVAAPAAPGEAI TTAERKVMCVI LAEDGGAEAGATLSEDDGAARAEALRD IAARHGGRLDRL 
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QARWWLVALSGAESPTDI^TRAAHCALALRAALC<3VPVSVATGI^VEARLPVGELVDRVAQL 
IAGRDGLSPPEIRLDDATASLLASRFETVQGPGGCWLRGPKEEPDAVPRLLGKPTPCVGRERE 
LSQLATEWRHCVDEPSANAWWGAPGLGKSRLAWEFLRTLEQREGAAI 

SEQ ID No 2 0 (>ORF2) 

VRPCARLNASPSVTASRSGSTAAGSVHASTSACVEQPATGRTQPASPRWPPGAAALRLTSAMP 
RWFNTAGPCNPADHYMLPAEERLPAVRDLVDRKAYFVLHAPRQIGKTTSLRTLAQDLTAEGRY 
VAVLVSAEVGAPFSDDPGAAELAMIJ^WRGTAGAQLPADLRPPPFPDAPAGQRIGAALRAWAQ 
AAPRPLWFLDEADALRDATLVSLLRQIRSGYPDRPRDFPHALALVGLRDVRDYKVASVDSGR 
LGTSSPFNIKVESLTLRNFTRDEVATLYAQHTAETGQVFRPDAVDRAFELTQGQPWLANAliAR 
QLVEVLVKDRAQPITSANVDRAK^ILIERQDTHLDSLVDRLREPRIRAVIEPMLAGTALPSVP 
PDDLRFAI DLGL VRMTAEGGLDVANP I YR3 1 1 VRELAF P IRASLPQ I KATWLTQDGRLDADRL 

LDAFLSFWRQHGEPLLGAAPYHEIAPHLVVMAFI^RVVNGGGTVEREYAIGRGRMDLCVRYAG 

ETIAIELKVWRDGRPDPVAEGLAQLDEYLAGLGLDRGWLILFDQRSGQPPIAERTRRERALSP 
AGREVAVIRA 

SEQ ID No 21 (>ORF3) 

VTIKKTFRSIDPATLPKHFDSPVAELRLADLWEADGTYRYDPSRPREETFWDTPPPTASGSL 
HIGHVFSYTHTDVWRQRRMRGFNIFYPMGWDDNGLPTERRVQNYFHVRTDVRTPYERGLTLP 
QAAPET I KKEPPR I VS RPNF I ELCHKVTREDEQVF KALFRRVGLS VDWRNE YAT I DDHCRRTA 

QLSFLDLHEKGHLYSVFAPTMWDVDFQTAVAQAEVEDRPQSGAFHDIAFAVEGTAEELVIATT 
RPELLAACVGVTAHPEDPRYQHLFGKTALTPIFRAPVPIFPSPLVDREKGTGILMVCTFGDAT 
DVIWWREQKLPLRQMLGKNGRVLPVTFGEGAWESRDPAAANAAYAPLQGRGVKQARAAWELL 
RREEHAAAPGRGPALRGEPRPIERAVKPYERGDQPLEFVPTRQWFVRLADKKAELLEYGDKIK 
WHPDFMRLRYRNWTEGLQGDWCISRQRYFGVQFPVWYPLDAEGNPDHSRPLLATREMLPVDPT 
VDVPPGYEASQRDQPGGFTAESDVFDTWFTSSLTPQISSHWGDDPARHARLFPADLRPQAHDI 
IRTWAFYTIAKAMLHESSVPWHHVAISGWIIjDPDRKKMSKSKGKWTPMHLLDTYSSDAVRYW 
SASARLGTDTAFDEKVLKIGKRLVTKIWNASKYVLSQSAEVHPISEELDRALLHKLSAVVDDA 
TRSFDEHEFAAALERTEDFFWRWFTDAYLELAKARARGEGGAGEAARGSAVAALRLGLSVLLR 
LFAPVLPYITDEVWRWVYAEETGDTSIHRAKWPSAADFAAVAAPSDPGLLDLAAAAMAAVNKR 
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KSELGASVGRVVTDLALGANAATIJ^LKPALGDVLTAVRAGAHALVRPEIADGEVLVVRCELE 
PAAAAAAGAGGAAASEE 

SEQ ID No 22 (>ORF4) 

MIHAEPFEARLVAARPLSPFVRELSFERADGRSFLFEAGQWVNLVLPLPGGEVKRAYSIASAP 
DGSPRFDLAVTLVQGGAGSEHLHRLEPGATLRAIGPHGLFTRDPGDSAPSLFVATGTGITPLR 
SMLRASLRAGLAAPHLWILFGARFEEDVIYRDELEALARGSDRIRYEITLSRGGPSWAGRRGY 
VQAHVPELYRELAEKSGDPAPHVFICGLDRMVSSVRELARGELGVHRKHVHVERYD 

SEQ ID No 23 (>ORF5) 

MKSLPSDRAARLAQSDIRTMTLACAKVHGINMSQGVCDTPVPSVI LQAVKEAMDRGCNTYSRF 
DG I VELRHAI AAKLARHNG I AADPETD I TVS AGATGAFQATCMALLNPGDE VLLFE PFY AYHA 
QAILAVEAVPRYVTARSLSWNVDGDELERAITPKTKAIWNSPGNPSGKVFGRMELEQIADLA 
CHHDLMVITDEI YEYFIFDGREHVSVASLPRMSERTITIGGYSKTFSITGWRIGYSVADARWA 
KAIGAMSDLLYVCAPTPLQHGVAAGIRGLPRSFYTGLAQGYERKRDRFCRALEKAGLPPCVPQ 
GTYYVLADVSRLPGRTGRERAIYLLDETGVAGVPGDAFFEGTQGSRFMRFCFAKTDEDLEEAC 
QRIEQLA 

SEQ ID No 24 (>ORF6) 

VSDPRKERLGDMDLEEFRRIGMRIIDWAADYLGHPDRYPVFPAIRPGDVKGRLAPTPPVEPEP 
MDAVLTDFEQI I LPGITHWNHPRFFAYFANTASGPGI LGELLAACLNVNVMLWRTS PAATELE 
ELVLSWLRQMLDLDAGLHGAIMDTA3TASMVAIAAARDSAEPTIRLRGMAGQRRMRLYASEQA 
HSSIEKAAITLGIGQEGVRKIPTD?AFRMVPEALRAAWEDLGAGLRPFCVAATVGTTSTTSV 
DPIPAIVSVCREHGLWLHVDAAYAGMAAIVPEHRDVLAGCEGADSLVVWPHKWLFTPMDCSVL 
YVRDADRLKRAFSLVPEYLRTEGDVTNYMDWGIQLGRRFRALKLWMIVRYFGHEGLAARIREH 
LRLGQQLAQWVDADPDWERLAPTPrSTVCFRMRPSALACIMRSADEAERESIERELDRLNEAL 
LDEVNKSGRVFLSHTRLHGRYTIRVAIGNIRSDEVAVREAWECLRAAGARLCADERFVSCSRS 
ADEGRGKS 
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SEQ ID No 25 (>ORF7) 

MRREEPVLEAFYERYCAAPRETSYHVELPVDVELHQEAAPALPQARSLEIAGRVALVTGSSRG 

IGKAIALRLAEQGADVAV^^yHSNKDAAEQTAAEIRALGRRT^WVQADVTRPNAAAELFSSV^ 

QLGPIDILVNNVGDFFFK?IJVAMTDDEVJRNV>©SNLSSVHYLCRAAVARM 

S PT YAI RGAPNVAA YS I AKTGVL I LTRS LATEEAPHG I LVNCVS PGL I DNGYLPPAQKE WMER 

RVPMGRLGRASEVADAVAFLASDRASYVSGANIAVAGGWDWTDRGTEHDRRVDLFIGHEEP 
SEQ ID No 26 (>ORF3) 

MSGRFPGARNVEELWQKLRAGVECVVTFTEAEALAAGVSREMLANPSYVRRGAPLDGVELFDA 

SFFGFSPRSAESMDPQQRIFLEVAWEALERAGYDPDAHSGPIGVFAGSAPSGYHSLAQSDPEI 

LGALGHYQLTLNNDKDYLTTHASYKLNLRGPSVCVQTSCSTSLVAVVMACQSLI^ 

GGVG I HAHQRRG YL YQENG I S S PDGHCRAFDVAAKGT VGGSG I G I WL KRLAD ALADGDHVHA 

VIRGAAINNDGSSKIGYTAPSVQGQAEVIGMAQALAGVEPDDISYIEAHGTGTPLGDPIEIAA 
IiTRVFRAKTARRQFCAIGSLKTNLGHLDAAAGVASLIKTVMALEHRELPPSLHPERPNPKLEL 
ESSPFYVNTRLTPWHAARGPRRAGVSSFGIGGTNAHVVLEEAPAPPPSGPSRRWQLLTLAARS 
EAGLARATADMIEHLDRHSGTSIADVTYTSHVGRRAWPFRRAWGESAADLRAALASEGSPRS 
I S SCQAARERPWFLFPGQGAQHLFMARELYEVEPI FRQSLDRCAELLRGPLGLDLRQVLYPA 
EGQRDDAEQELGRTAIAQPALFAIELSLAKLWMAWGIVPQAMIGHSVGEFAAACLAGI FREED 
ALRLVAERGRLMQQMPPGAMLAVPLAEPELAPYLSDDISLAAINGPALSWAGPIEAIDAIJ^ 
ELLDHGLSCRRLHTRHAFHSKMMAPWDAFTRCVSAVERRPPSGHFLSTLTGGWI S PEAATI P 
AYWARQLVEPVRFAQAVRQIiLSESTWLWLELGPGQTLSPLVRQQARADGGQWVASLPRAKDA 
GADHLAVIEALGRVWSAGGTVDWKRFHEGEARRRVLLPTYPFERQRYWASPRHTSAPPEAIIK 
PLLAKNPNVADWFFLPAWRRSDPPVSFDAQAVTTRRSTWLVFIGDEGLGAALVEGLARRGHEV 
VA WTGERFEQTGTQR YT I D PAANGDVAS LFARLE I EGRMPDRI VHAFCTS PADGAR I ERGAA 

LEIERRLGFDSLLLLAQVIAAQRHPKPLMLGVITTRAHSVIGTEIIEPLRALVLGPCRVIPQE 
IPHVSCRNIDIDLPGEGGRAEIAARLIADLERESPDSWAYRGGRRWVESIELTDVGRRSAGA 
APRLRQRGAYLI TGGLGG I GLVAAELLAREAHARL I LVGRTGL P ARQGWDDWLAAHGAGDATS 
RKILRIRALEEAGAEVKIAAADVSDFNAMRSVIEEARTRFGRIDGVIHSAGIASGGMIQLRTP 
MAAVIRVMAPKVGGTLVLDALLRDERPDFLLICSSLASLVGGATQIDYCAANAFLDAYAQSREG 
EEGCRVISVQWDTWSDVGMAVDFKLPADLQEGRRESLKRGISSSEGAEVLGRILSAGMSGPliA 
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ICTSDLPAYKQSVTTRRSQHEQTPAARPMHSRPTTTGAYVAPETETERRIAAIWQDLLGLEQV 

GANDDFLQLGGHSLLATQVLSRVLQTLKVGISLPQFFDAPTVAGIjSRLVDAARAEGAGPVAPA 
I GRVERDAYR I KP PAAEQAARTKP 

SEQ ID NO 27 (>ORF9) 

MEPVGGVDMNQPAKQQETCVFPTSFAQRRLWFLDQLEPGSAVYNMPASFRTRGPYDVDSLVRS 
VNEIVRRHESLRTTVDVIDGEPVQVIAPSLRIEVPVVDLSEIDEPEREAEARRU1AEESRRPF 
DLTRGPLLRAKLLRLGEADHVLILTMHHIVSDGWSMDVLFKELSTLYAAFHEGRPSPLPELPI 
QYADFAWQRELLQGEVLESHLGYWREHLRGAPTLLELPMDRPRPPAQTFRGSQRAFRLPLSL 
QQAVQALSRQEGATPFOTLLTAFSVLLSRYARQSDLWGTPIANRTRAELEGLIGFFVNMLAL 
RIDLGGDPSFRELLGRVREVTLGAYAHQDLPFERLVEELSPGRSPSHSPLFQVSFTLQNTPMD 
ATNRAD I ASGGA PLVEMKAAKFDL ILELSES PQGLLGTFE YNTDLFDAGT I ERMAGHLE VLLS 
SAVAAPDRPIAELPLMGAEERSRVLVEWNSTAALYPEDHCMHELFEQQVERSPEATAVLLQQQ 
TLTYRELNMRANQLAHHLRSLGVGPEVRVGLYLERS I ETWAI LGVLKAGGAYVPLDPTYPSE 
RLGLMMADAAPSVLLTQASLLSKLPPHGDATLVQLDAIjHEALSRLPHHTPRSGVTAQNLAYVM 
YTSGSTGRPKGVLVEHRGLCNLPTVQAKLYGIAPGDRLLQFAPLCFDTSFCEIALALLSGATL 
VMGTADELLPGPPLVELLKKHAVTAMLU^TVL^ 

WKAPGRRLFNS YGPTETT I WAS SAADLSDERI PP I GRP I ANTQ I YVLDEALEPVP I GVPGE I F 
IGGVGVARGYHGRPDLTAERFVPDPFGQTKGARLYRTGDRARWLPDGNLEFLGRNDEQVKVRG 
VRIELEEIRAALLKHPAVAQAVAWREDTPGDKRLVAYWGRGGARVTAAELRQSVSERLPAT 
MVPSSFVALDALPLTPNGKVDRRALPEPEQSAGGEDHVAPRNAVEEELARIWASVLRLERVGV 
HDNFFEIGGDSILSIQIWRAQQAGLRLTPRQMFQHQTIAELSTVARAVEAVHVEQDPVTGPA 
PLTPVQRWWLEQEAAEPHHFNQS I FLEVRERLDESALEQAI AHLIDHHDALRLRLARDERGAH 
QVFAAPGGSTPFQRVDLGALPSAEQISAMEKAASEAQASLDLAAGPVVRAVLFDLGEVAPQRL 
LVIAHHIAVDSVSWRILLDDLFGAYEQARRGEAVRLPPKTTSVKRWAELLTEHAGSEAVKAEL 
GYWLDSSRRTVAPLPVDRRAGEDVWGSARHIWSLTPEQTEQLLREVPQAYRTRIDDALLTAF 
AQAIARMTGSPAVLLDLEGHGREELAGVDLTRTVGWFTAMYPILLRVDAADPGEALKSIKEQIj 
RAVPGRGLGYGLLRYLRSDTIAEVRALPQAELCFNYLGQLDOAIPEAAPFRPAREYQGSERSP 
GAHRAHL r EVNAS I ANGRLYATWTYS ERRHEPETI ERVAAS FVTALRALI AHCTLPEVGGNTP 
SDFDKVRLRQETIDALDAIDAGPGPSARGSRIEDVYPLSPLQEGILFHTLYATDYTAYVEQFH 
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WTLEGDFDAEAFTRALQDWARHAALRTSFAWERLDAPLQIVRTGAVLPVEHQDLRGLAAEEQ 
TAHISRYVEAERQRRFDLRKAPLMRAGLLRLRKDAWCLVETIHHLILDGWSTQILLKEVFTLY 
EAHRGHRGHLALELEQPRPYGDYIGWLAKQDQVRTAAFWRRELEGFSAPTPLGVDRAVPHDDG 
GPRFGWRRIALSGDDAARLAAFARQKQLTMSTLVQGAWALLLSRYSGDPDVLFGMTVSGRSAP 
IPGIERMTGLFINTIPVRVREPADASVLAWLKALQEHEAELLEHEHSPLVEVQAHSDVPRGTP 
LFESLWFENYPVQVIFEAPPVEGPTRAEEGLRMIDAQYISDPPYPLTWAAFHGTLYLNIGY 
ERRRFDDQAVERMIGHVTTLLRGFVQRPETSVRDLPLLTAEEERTQLHAWNATAAPYPEGHCM 
HELFEQQVERS PEATAVLLQQQTLT YRELNI RANQLAHHLRS LGVGPEVRVGLCLERS I ETW 
AILGVLKAGGVYVPLDPTYPSERLGLMMEDAAPSVLLTQTSIjLSKLPPHGDATLVQLDALHEA 
LSRLPHHTPRSGVTAQNLAYVMYTSGSTGRPKGVLVEHRGLCNLPTVQAKLYAIAPSDRLLQF 
APLCFDTSFCEIAIALLSGATLVMGTADELLPGPPLVELLKKHAVTAMLIAPSVI^ALPEQQS 
AALPLRVLAMAGEACPAELVKRWKAPGRRLFNSYGPTETTIWASSAADLSDERIPPIGRPIAN 
TQIYVLDEALEPVPIGVPGEIFIGGVGVARGYHGRPDLTAERFVPDPFGQTKGARLYRTGDRA 
RWLPDGNLEFLGRNDEQVKVRGIRIELEEIRAALLKHPAVAQAVAWREDAPGDKRLVAYWG 
RGGARLTAAELRQSVSERLPATMVPSSFVALDALPLTPNGKVDRRALPEPERSAGGEDHVAPR 
NAIEEELTRIWADVLGAKRVGVHDNFFDLGGHSLLLVRVHDRLGQRFDRPPSMVDLFTYPTVA 
SLARFLGERANGKQSPREAAADVTERGRRRLEARARRAKAIRGPT 

SEQ ID No 28 (>ORF10) 

MKHNIGWLLPAALATLAFVPACSPNHGEDAPSWSAESGAAPSADCVALGAiCLQAALDGAAAA 
QKAPGAAAAVQSGDCVWRGATGVSDLVASTPTKPGDLFRIGSITKTFVSTLILMLRAEGRLSL 
DDAVSKYVKGIPAGDQMTLRQILGHTSGLFDYTYSPALGQMIEVDPTRAFAPAELIALATAEA 
PYFAPGAGFRYSNTNYIVAGLVAEAVSGGTLAGLLRTRILDPVGLAHTYLDGAEPPVQGLIRG 
YGD YGAGL VD I TDQLS PTEAWAAGALVSNVDDLNRF FALL I SHELLS SDELQDMTTWTPTMWP 
HEPGYGLGLI ERDSALGSLNGHCG 1 1 WGFQS AS YGVPGRGDA I TAL I NRSDGDAARLVDELAK 
WKER. 

SEQ ID No 29 (>ORFll) 

MSIDRAVLEQLDRVGGRLAEGKALKLLEDIAWPREVEERFFAAGEDRLPEVEYRVDRDGIARR 
VAELRELLGAIDGDAPALGWLRDNVRAQIQAAELLEAAGTRAFSARSQELYGGARSRFFGGSL 
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RNIDIJ^EHLTERLRVHGWDEASDPEEEPLDAGA^ 

VAGMSRVRIRPEATFAAWEAEGLWHHEVETHALTAHNGAAQPRCAFLRSGGPRTTRTQEGLAI 
FAELYSRSLSIGRLTRLAERVRLVDMAEQGASFLDLYRHLRERGASRRDAYPDAQRVCRGGLV 
EGGAPFTKDACYLAGLLEVYAFLAAVLRGGLRDEVELLVCGRIALDDIAVLAELRAAGVLERP 
RYLPGWLRAWQTLLPYFAFTSFMDGIDLGPVERHFQELLRVAADARPAGEGRRRRGRPREG 

SEQ ID No 30 (>ORF12) 

MSESVAQLEEHRAALTGHCYRMLGSVVDADDAVQETOVRAWRSLDKFDGRSSLRTWLYRIATN 

VCIDLRADRARRARPIEEGPVGTVDDALETRPRTHWLEPVPDAHALPADIDAAERAMLRQSIR 

IAFVAALQHLPPKQRAALLLTEVLGW3AAEVADSLNTSVAAINSALQRARATLASRDLGDARP 

SLPEPQSALLDRYVNAFERYDVDALTALLHQDATLSMPPFTLWLRGHESIRAWLVGPGAGCRG 

SRLIPTAASGSPAFAQYRPAPEGGHRAWALIVLDVAGDRIVSMTSFLDTETLFPRFGLPLDLP 
A 

SEQ ID No 31 <>ORF13) 

VTIASIDHRDQDL^1TGPQAKAPARAAAPDAAPSRRAVWAGRVLSGIATLFLTFDAAVKVLKLF 
PAEASTAELGFPAHLVPTLGYLQIACLVAYLIPRTAVLGAILWTGYLGGAIAIHVRVENPLFS 
HTLFPIYVAAFLWAGLWLRDRRVRALTASPSSQGR 

SEQ ID No 32 (>ORF14) 

MTTKNPRKLFWLSVRDLKRSMEFFSKLGFEF^PQFTDEKAACMWSEEAYVMLLVESFFKTF 
MKKEICSTSTHTEGLFALSCSSRAEVDDMVKKAVAAGGSHAMDPQDHGFMYGWSFYDVDGHHW 
EVMWMDPKAIQP 

SEQ ID No 33 (>ORF15) 

MTPSERLDATFAALADPTRRAIIARLASGEMVTELAKPFAMSQPAISKHLKVLERAGLISRG 
RDAQRRPCRIEAKPLEDASGWLDNYRRFWEGSYERLDDLLEELKERESKGERSKR 
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SEQ ID No 34 (>ORF16) 

VAPASAPAAGGRDAAPFLDEAAQWLRGEQAPASRPAGEGPAGRLPGRVLVADDNADMREYALR 
LLVAEGWTVEAVADGRAAIjERARAHPPDLVLTDVRMPRLDGFGLLRALRADDRTRGVAVVMLS 
ARAGEEARVDSLEAGADDFLVKPFSAKELIARWIHVELARRRREAEGQRQYLNDLFMQAPGP 
IAILRGPEHVFEWNPLYQRLVGGRSLVGEPIRAALPELEGQGIWELLDAWRTGEPIVGKEL 

pvrldrrgdgtteevffnfvyqpmrdrdgavegvfvfafdvtdqvrarrrv^ 
rkdeflamlahelrnpmas I slsltllddadgdgpasaryrei arrqmghlvrlvddlldvsr 
itrgtvelrledvdlaawqsaaaavrpavearrhdvslsvgpgdfgmradatrleqvvtnll 
tnaakytppggs i svrltreaavgapeavlrvrdtgrgi paamlekvfdlftqvdqtidrstg 

glglgltlvrrllelhggsvaaasagpgqgseftvrlplgpgaapqpapsagppppregpppa 

qrdepppppaqraeapeaaadrrrvlwedaedvrrvmrayiealghevtvavdglegvkkll 

elrpevafvdiglpgidgyevarraraapggealylvalsgyggpddqarsrragfdlhltkp 
wgatlqdvltaprt 

9. DNA sequence according to claim 7 selected from the fol- 
lowing 

(a) open reading frames, and peptide sequences corresponding to 
said open reading frames: 

pEPOcos6_ORFl sequences : 

(1) nucleotide sequence 

Seq ID No 3 5 (>pEPOcos6_ORFl . seq) 

GGATCACCTGCGGCGCGATCGCCGACCTCGTGCTGGTGTTCGGCTCGCTGGATGAGAAGCCGG 
CGGCGCTACTGATAGAGACGGCGACGCCCGGGCTGCGGGTGGAGCGGTTGCGGGAGATGCTCG 
GCTTTCGGGCGGCCCACCTGGCGAAGCTGTCCTTCGACGGTTGCGAGGTCCCCGAGGCTCAGC 
TGATTGGCCGGCCCGGCTTTGCGCTGATGTATCTGGCCCCCTACGCCCTGGATTTCGGTCGGG 
TCAGCGTCGCCTGGGCCTGCCTGGGCATGATCCGCGCTTGCCTGGAGACCTGCGCACAGCACA 
TCCTCACCCGCCGCACCTTCGGCCACCTGCTAGCCGATCACGGCATGATCCAAACCCTGATCA 
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CCAACCTGGGGATTCACCACCAGGCGACGCTGCTCCACACGCTGCAGGCCTGCCGCGCCAGGG 
ATCGCGGCGACGTGACCGCCTCCGAGGCCACCCTCGCCGCCAAATACCTCGCGTCGCGGACGG 
CGGTCCAGGAGACGACCAACGCGGTCCAGATCATGGGCGCGCTGGGCTGCGACGAGGAGGGCG 
CGATCGCCCGCCACTTCCGCGACGCCAAGACGACCGAAATCATCGAAGGCAGCAACCAGATCA 
TCGAGGCGCTGCTGGCCAAGAACATCGCCCGCGCCGGTCGCGACAACTATCGCCGCTTCCTCG 
ATGCGGAAGTCGAGCCCGGTCGGGCCGGAGGCGCACCA 

(2) peptide sequence 

Seq ID No 3 6 ( >pEPOcos6_ORFl .pep) 

ITCGAIADLVLVFGSLDEKPAALL-2TATPGLRVERLREMLGFRAAHLAKLSFDGCEVPEAQL 
I GR PGF ALMYLAP YALDFGRVS VAWACLGMI RACLETCAQH I LTRRTFGHLLADHGMI QTL I T 
NLG I HHQATLLHTLQACRARDRGDVTASEATIiAAXYLASRTAVQETTNAVQ I MGALGCDEEGA 
I ARHFRDAKTTE 1 1 HGSNQ 1 1 EALLAKN I ARAGRDNYRRFLDAE VE PGRAGGAP * 

pEPOcos6_ORF2 sequences: 

(1) nucleotide sequence 

Seq ID No 37 ( >pEPOcos6_ORF2 . seq) 

ATGACGAGCGCGGTCCCGACGCGTCAAACCAGCCTGCTCGACGACTTCGAGCGCGTCGCCGAC 
GTCGATCCAGAGCGGATCGCCGTCCACGCGAGCGAGACGAGCCTGCGCTATGGCGACATGAAT 
GCGCGCGCCAACCGCATTGCCCACGGGCTACGGGCGCGCGGGATCGGGCCCAATCAAATCGTG 
GCGGTGGCGATGGCCCGCACGCCCGAGCTGATGATCGTGCTGTACGGCATCCTCAAGGCCGGC 
GCGGCCTACATGCCCATCGCCCGCGACGCGCCGCCGCTGCGCCGCGATCATATGCTGCGCGAG 
AGCCAGGCTGCTCTGATGATCGCCGACGAAGAGATCGCGGGACTCGCGGCCCGGGTGCTGACG 
CCGGCCGACCCGTTCTTCGCGGCCATGCCGGACCACAACCCCGAGCCGCGTCACGACCCGACC 
GACCTGATTTACGTCATCTACACCTCGGGCTCGACCGGCCAGCCCAAGGGCGTGGCCATGGAG 
CACCGCGCCGTGTGGAATCGCCTGACTTGGATGCAGGCCCAGTATCCAATCGACACGCAGGAC 
GTGATCCTCCAAAAGACGCCGATCGTCTTCGACGTGTCGGTCTGGGAGCTGTTCTGGTGGCCG 
CTGGCCGGCGCCTCGGTGGCCCTGCTGCCGCAATCCATGGAGAAGTTCCCCTGGGCGATATCG 
GCGACGGTGGCGCGGTGCGGGGTGACGGTGATGCATTTCGTACCATCGATGCTGATGGCCTTC 
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CTTCAGGTGGTGGCGGGCCGGCCCGAGATGGCGGACCAGATGAAGGGCCTGCGCTACGTCTTC 
TGCAGCGGCGAGGCCCTGGCGCCGGCCCACGTGTCAGCCTTTCAGGAGCACATCAACCGAGCG 
GGCAGCATCAGCTTGACCAACCTCTATGGACCCACCGAGGCGGCGGTCGACGTCAGCTACTTC 
GACTGCCCGCCCGGCGCGTCACTCGCGCGGGTGCCGATCGGACGAGCGATCACCGGCATCCAG 
CTGCTGGTCATGCGCGACGGCGTGCCTCAGCCGCCCGGCGTCGAGGGTGAGCTCGCCATCGGC 
GGCGTTGGTTTGGCGCGCGGCTACATCTCACGGCCAGACCTGACCGCCGACCGGTTCGTGCCG 
CATCCAGGCGGCGACGGCCAGCGGCTC7ACCGCACCGGCGATCTGGTGCGCAGGGACGCGGAC 
GGCGAGCTGGTCTTCCTGGGGCGCATCGACCATCAGGTGAAAATTCGCGGTCTGCGCATCGAG 
CCCGGGGAAATCGAGGCCCAGATCAGCGCCCATCCCGATGTGGCCGACTGCGCGCTGATTATC 
GAGCAGGACTCGGAAACCCTGCCCAAGCTGACCGCCTACATTGTCGTGGCGCGACCGGGCTTG 
ACCCGGAAGGCGCTGCTACAGTTCC7GGGCGCGCGGCTGCCCGACTACATGCTCCCGAACCGC 
TTCCTGACCCTCACGGAGCTGCCCC-TGACCGCCAACGGTAAGCGCGACTGGCGCGCGCTGCTC 
GGCCCGCTCGAGACCCTGCCTCTCCCTTTCTCC 

(2) peptide sequence 

Seq ID No 3 8 (>pEPOcos£_ORF2 . pep) 

MTSAVPTRQTSLLDDFERVADVDPERIAVHASETSLRYGDMNARANRIAHGLRARGIGPNQIV 
AVAMARTPELMIVLYGItKAGAAYV?IARDAPPLRRDHMLRESQAALMIADEEIAGLAARVLT 
PADPFFAAMPDHNPEPRHDPTDLir/rYTSGSTGQPKGVAMEHRAVWNRLTWMQAQYPIDTQD 
VILQKTPIVFDVSWELFWWPLAGA3VALLPQSMEKFPWAISATVARCGVTVMHFVPSMLMAF 
LQWAGRPEMADQMKGLRYVFCSGZALAPAHVSAFQEHINRAGS I SLTNLYGPTEAAVDVSYF 
DCP PGASLARVP IGRAI TG IQLLV>'RDGVPQP PGVEGELAI GGVGLARGY I SRPDLTADRFVP 
HPGGDGQRLYRTGDLVRRDADGELVFLGRIDHQVKIRGLRIEPGEIEAQI SAHPDVADCALI I 
EQDSETLPKLTAYIWARPGLTRKA^LQFLGARLPDYMLPNRFLTLTELPVTANGKRDWRALL 
GPLETLPLPFS 
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pEPOcos6_ORF3 sequences : 

(1) nucleotide sequence 

Seq ID No 39 ( >pEPOcos6_ORF3 . seq) 

ATGTTACACCCGATTCCCACCGACCGTTTCGCCCTGAGCCGACCGCTCTTTCGCGGGTACCTC 
GCGCACGATCCGATCGTGCAGGGCGTGCTGGCGGGCGACCATCCAGGCTGGGTCCTGGTGGAC 
CGCGAGCCCGAGCCGCGCACGGCGCTGCTGTGGGCCTTTTCCGATCGGCTCTTCTGCGTGGGC 
GCAGCTGACACGCTGACCCCGCACGCGCTGGCCGAGCTGTTCCACGACCGACTGATCCCCCAG 
GCCCGTAAGATCGGGCAGCCGTTTTTCCAGGTTCAGGGCGAGACGGTCGACACCTGGTCGGAC 
CACCTGCATCAGGTGTCGCCGCACGCGACAGTCTCCTTCCGCCAGGCATTCCGCTTCGACCGC 
GACCTCTTCGAGCGGCTGCCAACCAAGCCGGAGCTGGCAGAGGCGCGGCTCGTGCCAATCGAC 
GCGCGGCTGCTGGCCGAACAGGCTGATCTGCGCGAGCGGATACTGGCCTCCTGGTCCAGCGAA 
GCTGCCTTCCATGCGCGCGGTTTCGGCTTCTGCTACCGCGTAGGTGACCAGCTGCCGAGCGTG 
TGCCTGGCATCGCACGTAGGCGGCGGCGCGGCCGAGCTGAGCATCAACACCGAGCTCGAAGCG 
CGCAATCGAGGTATGGCAACGCGGCTGTGCCGGCGTTTCATCGCCGAATCGCTGCAGCGCGGC 
CTGACGCCTTGCTGGCXSCACCGAGACCTTTCGCCTGCCGTCAATCGCGCTGGCCCAGAAGCTC 
GGTTTCATCCCGACCTTCACCTTCCCCACCTACTGCTTCGCGACCGGCACCGAACAGCCGGAC 
GACAACTTCCTAGGCGAGCTGTACTACAGGGAATCGCGCATCGCCGGAAGTGGGACCGATGAG 
CCGCAAGCGGTTCGGCTGGCGCGGGGTTGGAGCCTGGCCGGCGACACCGAGCGTGCCGCGAGC 
TTCGCCGCACGCGCCCTGGCCGAAGGGTGGGCCGGCCACTCGACTCTGGCCACCGATCCGGAT 
TTCGCCCGATTGCGCGCCAGCGCCGCCTGGCCCCGCCTCAATGTCCCT 

(2) peptide sequence 

Seq ID No 40 (>pEPOcos5_ORF3 .pep) 

MLHPIPTDRFALSRPLFRGYLAHDPIVQGVLAGDHPGWVLVDREPEPRTALLWAFSDRLFCVG 
AADTLTPHALAELFHDRLI PQARKI GQPFFQVQGETVDTWSDHLHQVS PHATVS FRQAFRFDR 
DLFERLPTKPELAEARLVPIDARLLAEQADLRERIliASWSSEAAFHARGFGFCYRVGDQLPSV 
C1ASHVGGGAAELSINTELEARNR3MATRLCRRFIAESLQRGLTPCWGTETFRLPSIALAQKL 

GFIPTFTFPTYCFATGTEQPDDN7LGELYYRESRIAGSGTDEPQAVRLARGWSLAGDTERAAS 
FAARALAEGWAGHSTLATDPDFAP.liRASAAWPRLNVP 
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pEPOcos6_ORF4 sequences: 

(1) nucleotide sequence 

Seq ID No 41 ( >pEPOcos6_ORF4 . seq) 

ATGATTTGTCACTCCCACCGCTTCATTTTCCTCCACGTTCCCAAGGTCGCCGGCACAAGCGTC 
AAGGACGTCCTCGGCCAAGAGCTATTCCAGGAGGACCAGGTCACGTTCCAGATCGCTCCCAAT 
CCCCACTACCCACCTGAATGGACTGCGCCTTACGAGGAGCACATTATTGCCGCTGAATTGAAG 
AGCCAGTTGGCGCCGGAAATTTGGGACGATTACTTCAAGTTCGCCTTCGTGCGCCATCCGCTC 
GACTGGGCGGTCTCCAATTACTTCTTCTTCCTGCGCGACCGCAAAGGCCATCCGGCCCACGAA 
TTCCTGGAGCGGAAGGGCTTCGCCGGTACCATGGACATGTTTTTCGGAGCGGCCGGGCGCCAT 
CCGCTGGTCGCCGGCATGCGCTTCAGCCAATGGGAGTTCTTGTGCGACAGCGAGGGCCGGACG 
CTGGTGGACTTCGTTGGCAAGTACGAGCGGCTCGAGCAGGACTTCGCCGCCGTGTGTATCCGC 
ATCGGGCTGACCCCGCCCGACTTGCCGTGCCTCAACCAGACTCGCCACCAATCCTTTACCAGT 
TACTACGACGAGGCTTTGATGCGCCAAGTCAGCCGCGCGTTAGCTCGCGATTTCGAAATTTTT 
GATTATGCC 

(2) peptide sequence 

Seq ID No 42 ( >pEPOcos6_ORF4 . pep) 

MICHSHRFIFLHVPKVAGTSVKDVLGQELFQEDQVTFQIAPNPHYPPEWTAPYEEHIIAAELK 

SQLAPEIWDDYFKFAFVRHPLDWAVSNYFFFLRDRKGHPAHEFLERKGFAGTMDMFFGAAGRH 

PLVAGMRFSQWEFLCDSEGRTLVDFVGKYERLEQDFAAVCIRIGLTPPDLPCLNQTRHQSFTS 
YYDEALMRQVSRALARDFE IFDYA 

pEPOcos6_ORF5 sequences: 

(1) nucleotide sequence 

Seq ID No 43 ( >pEPOcos6_ORF5 . seq) 

ATGAAAGTGGACAAGCGGAATGTCGACGACATTCTCGGACTCACTCCGACACAGACAGGCATC 
TTGTACCACTACCTGCTGGACCCGCAGGCCGACGCCTATTTCGAACAATTGACGCTGCACCTG 
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GAGGGGCCGCTCGACGTAGCGCGCTTCCGCCGCGCCTGGGAGCGCGTGGTGGCGGCTCACGAC 
CAGCTGCGCGCCGTGTTTCGCTGGCAAGGGATCGAACACCCGGTGCAGATCATCCTCAAGCAG 
CACGTGCCGGACCTGGAGTTGGCGGAGGTCCCGCGCGACGCCGATCCGGCAGCCTTCCTGGCG 
CAATGGGTCGCGGCCGACCGGGCGCGC7VAGTTCGACTTCGAGACGGTGCCCTTTCGCATCGGC 
CTCTGCCGGACTGATACCCAACATCACGTGATGCTGCTCAGCAATCACCATATCCTGATGGAC 
GGTTGGAGTACGGGCCTGATTCTGCGGGACTTCCTCGCCTGCTACGGCGACTCCGAAAACTGG 
CGGCCACGCACCCGAACGCACTTCAAGGCGTTCATCAAGTGGCACCAGAACCGGCCACGCCGG 
GGCGAGGAGCGATTTTGGCGCGACCTGTTGCGCGATGCGCCCGACGGCGGCTTTCCCCGCCTG 
GGCGTCGAAGAAGGCACCCGCCACTCGCTTGACTTCGGCGCCCGCAGCCGCGCTCTCGACGAC 
CGCTTGACCCAAGGCTTGCGCGACATGGCTCGCGACCTCGACGTCACCCTCGCCGCGATGCTC 
CATACCGCTTGGGGCCTTCTACTCCAGCGCTACCAGAACAGCTGCGAAGTGATATTCGGGACC 
ACCGTTTCCGGCCGCAACGTCGAGCTCGCCGGCCTCGACGAGGTGGTCGGCTTGTTCATCAAC 
ACGATTCCGTTCCGCTTCTCGGCCGCGGCCGCGACGACGCCCGTCGAGGCCTTCCGTGCGGTA 
CAGCGCAATCTGCTGGCGAGAAGCGAGTTCGAAGCCACCCCGCTGGTGGACATCAAGGGCTGG 
AGTGGTCTCGGTCCGGGCGCGGAACTGTTCGACACCATCCTGGTCATCGAGAACTATCCCTTG 
GACCGCGCTATCTTCGAGAGTGATTCCAGCCTGCGGTTGACCGACCACCAAATCTTCGAGCGC 
ACCAATTACGGGCTGACCCTGACCATCGAGACCTTCAGCCGGTTGCACGTGACGCTAGCCCAT 
CGCCGTGACCTGCTGGGCGACGCGGCCGCTGAGCGAATGCTAGATCATTTCACCGGCCTGCTC 
CAAGCCATGCTGCGCTTCCCTCACCAGCCGTTCGCGCGCCTCGAGATGAAAAGCGAACACGAG 
GCCCACCGCGTCCTGCACCAACTCAACCAAACGCGTCAGCCGCTGCCGTCCCAATCGGCTTTC 
CACCAGTTGTTCTTCGAGCAGGCCCAGGCCGATGGGGCACGACCGGCGCTGTGGTGCGGCGCC 
ACGCGCTGGACCTACGGCCAGCTGCTGGAACGTGCCCTGCGTCTGGCGGGACGGCTGCAGGAA 
GCCGGCTTCGCCCGAGGCGATGTCGCCGCCGTCAGCCTCGGCCCGGTTCCGGATCTGATTCCC 
GGTTTGCTGGGCCCGCTGTTCGCCGGCGGCGCCTACCTGCCGCTCGATCCCACCCTGCCGGCC 
CAGCGCTCGCGGTTCATCCTCGACGATGCCGGTTGCCGCTTCCTGATCAGCGACGCGCCACTC 
' GCGGGGCCCACGCCGATCCATCCGGACCCTGCCGGCGCCAGCCCCGTTGACGTCATTTTTGCC 
TGTCAGGACGGCGCCGCGCAGCCCGCCTACCTGATCTACACCTCGGGCTCCACCGGCCAGCCC 
AAAGGCGTCTGGGTTAGCCACCGCAACCTGATCAACTTCCTGACGGGCATGAGCGCAATCCTG 
CCGGTCGCGGCCGACGACGTGTTCCTCTCGCTGACTACCGTGTCGTTCGACATTTTCGGGCTC 
GAGACGTGGTTCCCGCTCAGCCGCGGCTGCACGATCGTCTTGGGCACGCGCGCCGAGCAGTTG 
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GACCCGGCCGCGGCTGCCAAGGCCATCTCCTGCCATGGCGTCACGGTTTACCAGGCGACGCCA 
TCGCGACTCCAACTTCAACTGGAGCACCCCACATTTGTCCGCGCCATCGGCTCCCTGACGACC 
CTGCTGGTAGGCGGCGAACCCCTCCCAGCCGAGCTGCTGCGGCGCGTACGCGAAGTGACCGAT 
GCGCGTATCTTCAACCTCTACGGTCCCACCGAAACCACCATCTGGTCCACAGCCGGGGAGGTC 
ACCGCGGCGGACGTCCCGGATATCGGCCGCCCGATCGCAAATACCGGCGTTTTCCTTCTGGCG 
CGAGACGGCTCGATCCAGCCGCCGGGCCTGGTGGGCGAGTTGTGCATCGCCGGCGAGGGCGTG 
GCGTTGGGCTACCACCGACGGCCGGACCTGAACCGAGAACGGTTTCGCGAGATTCCGCCGGGC 
CGCCTGCCCTTTGCCGGCAAGCTCTACCACACCGGCGACCTGGCCCGCTGGACCGAAGACGGA 
CGGCTCCTCTGCCTGGGCCGTCTGGACGACCAGCTCAAAGTGCGCGGCCATCGCGTCGAGCCG 
GGCGAGATCGAGGCAGTGATGGCGCGCCACCCGGCGGTCACGCAGGCGGTGGTCGTCACGCGG 
CCGCGCAACGGCGAGCCGGTCTTGGTCGGGTTCTGGACTGCGGAAGGTGAGCCGATGCCAGAG 
GAAGCGCTGAGCGCTTACCTGGCCGACCGACTGCCGAGCTACATGGTACCCGAACGGTGCATC 
CTCATGAAGGCCATGCCGCTAACCGGCAACGGCAAGATCGACCGGCGCGCCCTACCCAATCCC 
TTCGCCTTGACCGAGTCGACCCGGCAGGCGGCGCCGCGCACCTTGGCCCGCACCGCCGGCGAG 
CATCGGGTTGCCGAGCTGTGGCAGGCCTTGTTGCGACGCGAGGCGATCGGCTTGGACGAACCC 
TTTTTTCAGGCCGGCGGGAACTCATTCGGCTTGATTCGGCTTCACGCCAAGCTGGAATCCGCC 
TTCGGGAAGTCGTTCCCGATCACCGATTTGTTCCAGCATACCAGTATTCGCAGCCAGGCAGAA 
ATGCTGAGCGGCTCGTCCGTCGAGGCGCCGCTCGCGGGAGCCGTGCCGCAACCCCCGGCCGCC 
GCCGCCCAAGTTGCCTCCTCGGCAGCTAAATCCCCAGGGGAGCGCGGCGCGGCAGCGACGTCG 
AGCGGCCTGACCGCGCAACCGCCCCAACCCCACTTCCGGCCCATCGCCGTTATCGGCCTCGCC 
GGCCGATTCCCCGCCGCACCCGACCTCGACGCCTTCCTTGAACTGCTCACGGAGGGTCGCTGC 
GGCATTCGCTTCTTCAGCCAAGCCGAGCTGCGCGACGAGGGTCTCGACGCGAATCGAATCGCG 
TGTCATAACTATGTCCCGGCCAAAGGTTTCCTCGACCGGGCCGACCACTTTGATGCCGACTTC 
TTCGGCATCCCGCCGCGCGACGCAGAAATCACCGATCCGCAAATTCGGCTTCTGCTTGAGTGC 
TGCTGGAACGCGCTGGAGCATGCCGGCTACCCGCCCGGCGGCGGCGAGATCGGGCTCTTCGCC 
GGCTCCTCGGCCAACTATCACTGGCTCGAATACGTGGGCATTTCCGAGGAGAGCAGCAATCGA 
TTCGCCGTCATGATTCAAAACGAAAAGGACTACCTGGCCACGCGGATCGCCTACCAGCTCGAT 
TTGAAGGGCATTGCCGTCACCGTGCAAACGGCCTGCTCGTCGTCGCTGACCGCGGTCGAGCTG 
GCCTGCGATGCGTTACACGCCGGCCGCGTGACCATGGCTTTGGCTGGTGGCGTTGGTCTGACC 
TATCCGTTGCGCGCCGGATACCTGCACGAGGATGGAATGATCTTCTCCCCCGACGGTCGGTGC 
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CGGGCCTTCGACGCCCAGGCGGCCGGCACGGTCTGCGGCAACGGTCTGGGCATGGTGGTGCTG 

AAACAGCTCGACGCGGCGCTGGCCGACGGCGATGCCATCCACGCTGTGATTAAGGGCATCGCG 

GCCAACAACGACGGCGCGGCCAAGATCGGCTACACGGCGCCCTCGCAGAACGGTCAGGCGCGG 

GTGATCCGCGCCGCCCATAGGCTCGCCCAAGTCGCGCCGGAGACCATCGGCTATGTAGAAGCC 

CACGGTTCGGGCACGCCGCTGGGCGATCCGATCGAGGTGGCGGGCCTGACCGAGGCCTTTGAC 

AGCCCGCGTCGCGGCTTCTGCGCCTTGGGTTCGGTCAAGTCGAATGTGGGTCATTTGGATGCG 

GCAGCGGGCATCGCGGGTTrCATCAAGGCGGTGCTCTCGCTGTCCCATCGGACCCTGTTCGCC 

AGCCTCCACGTCGACACGCCCAACCCGCAGATCCCGTTCGCCGACGGTCCGTTCCAGGTCAAC 

ACGGAGACCCGGCCCTGGCCAGCTGCCGACCATCCCCGCCGCGCCGGCGTCAGCTCCTTCGGC 

ATCGGCGGCACCAACGTGCACGCCGTCCTGGAAGAGGGGCCGCAGTTGGCCGAGCACGCGGGG 

CGGCGGCGCGAGCGGCAGCTGTTCCTGGTCTCGGCGCGGACTGCAGCCGATCTGGAGCGACGC' 

ACCGCGGCGCTGGTCCGCCACCTGGCCGCGCATCCGGACCTCGCACCAGATGACGTTGCCTTT 

ACCTTGCACGCGGGCCGCAAACCGATGACCCACCGTCGTTTCCTGGTCGCCGCCGACCTCGCG 

GAAGCCGCCGCGCGTCTGGCCGAGCCCGATCCAGTCAAATCCGCCGCGGCGCGCGCCGACCGC 

TGCCAGGTCTGGATGTTCGCCGGTCTCGGCTCTCAATACCCCGGCATGTGTGGCGGCCTCTAT 

CGCACCGAGCCGGCCTTTCGCGAGCAAGTCGACCGCTGTTTCGACCTCCTCGCGCCGCGTTGC 

GATTTGAAGCCCTCGCTCTTCCCCGAGCCCGATCAGGCCATCGACGCATCAGCCCTCGCGGCC 

ATCGACACCGCCCAGATCGCCGTCTTCGTCTGCGAATACGCGCTCGCACGGATGCTGGAAGGC 

TGGGGGCTGCGTCCGGATCGGCTGATCGGTTACAGTTTCGGCGAATACGTGGCCGCCTGCCTG 

GCCGGCGTCTTCTCCCTGCCCGACGCCTTGGCAATCGTCCGCGAGCGTGGCCGGATCCTGGCG 

GCGGCCGAGCCGGGCGCGATGGTCAGCGTGCCCCTTCCGGCCGAGCGCGTCGCGTCGCTGCTG 

GAGCCGCCGCTTGCCTTGGCCATTGACAACGGCCCCTCATGCGTGGTGTCCGGGCCGGTCGAA 

CCGGTGCGCACCTTCACCGCTCGCATGAAGCGGGACCGGGTCTGGGTGACGCCGCTCCAGGCC 

GAGCGCCCGATGCATTCGCCGCTGATGGCCGAGGCCGGCGGCTCACTGCGCGCCATGTTGGCC 

GGGTTCCGCCTGAATGCGCCGCGAATCCCGATCTTAAGCAATGTTACAGGAACCTACCTAACC 

GACGAGCAGGCCCGAGACCCCGATTACTGGGCCCGTCACCTGTGCGGCAACGTTCGCTTCGCC 

GACGGTGTGCGAACCTTGTTGGCCGAGCGCGATCCGGTGTTCCTTGAATTCGGGCCGGGCCGC 

GATCTGAGCTCCTTGGTGCGCCACCAGATGCCGGAAGGCGCCGACGAGCCGATCGCACTGATC 

CGTCATCGCGAAGATCCGGTGCGCGACGAAGACCTCCTGCTCGATGGCTTGGGCCGCTGCTTC 

CTGCGTGGGGCGACCCTCCACGGGCAGGCCTTGTACGCCGGCCGAGGCTGCCGCCGCGTGCCG 
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CTGCCCGGTTACCC6TTCCAGGGTCCACGCTGCATGCCGGCCCGCGCCGGACTGCCCGGCCTG 
GCGCGACCGACCGTGGGAGCGACCACCATCAGCTACCGACCAGCCTGGAAGCGGGCGCCGCGC 
TTGGCGGCTGTCGAATCGCTCGCGCCGCAATCCTGGTTGGTATTCAGCGACGGCAGCGAATTG 
GCGGGCGAGCTGGTGGCCGGCCTGCGCGCTTCCGGTTGCGCGACCACCCTCGTCGAAGGTGGG 
CTGGCGTTCGCGCGCTTCGCGGGCGGCTTCCGCGCGAATCCCCGCGAGGAACAAGATCTCGCA 
CAGCTGTTCGCGACCCTGTCGGCCGAAGCGATGCTGCCCACCCACATCCTGCACCTGCTCAGC 
CTGCCGTCGCCGGAGCGCGACTCGCCGCTGGCGCGCCTGGAGCACCTCACCGAGCTGGGCTTC 
CACCATCTGCTGGCCCTGGCCCGCCAACTGGAGGCGGTCGGCGCCCCCGAGGTCCGCCTCGCC 
GTGGTGACAACCGGCCTGGCGGCGATTGGCGGCGAGTCCGAGCTGCGGCCCGAGGTCGGGCTG 
TTGCGGGGACCTGTCCGCGTGATTCCCTTTGAATTCCCGAACTTGCGGCTGCGCCTGATCGAC 
CTCGACTCGGCCGATCCCATCTGGCGTAGCGGTTGTGAGCCGTTGCTGCGCGAAATGGGCGCT 
GCCCCGGGACCTGAAGAAATCGCGCTGCGCGGCACCAGCCGTTGGGAGTTGGGCTACGAGCCG 
GTCGAGGGGGGCACCGTGAGCACCATCTCCTCGCGACTGCGCGAGGGCGGCGTCTATCTGATC 
ACCGGTGGCCTCGGCGGCCTGGGTCTGGCCTTGGCCCGTCACCTCGCCCGGAAGTACCGCGCC 
ACCCTGATCCTCGCTGGCCGGCGAGGCGCGCCGGCGCGCGAGCTCTGGCACCAGGCGCCAGCG 
GAGTTCGTACCGGTCGCAGCTGCGATCGCACAGATGGAGGAGTGTGGCGCCCGCGTGATTCCC 
GTCGCGCTCGACGTCACCGACGCCGACCAAGTGAACGCGTTGTTCGCCACCATAGAAGCTACG 
GTCGGCAAGATTGAAGGCGTTTTCCACATGGCTGGCATCGTTGACGGCGGCATCATTCGAACG 
CGCACGCGCGCTGCCAGCGACGCCGTGCTGGCGCCCAAAACGGTCGGAACCTGGATTCTCGAT 
CGGGCTCTCCGCGGCGCCGGTGGCCGCTTCCTGGTGCTGTACTCCTCGATCAACGCGGTCGTC 
GCGCCCTTCGGCCAGGTTGCCTACGCCGCCGCCAACGCCTTCCTCGACGCCTTCGCCAGCGCC 
CACGAACACGACGAGCGTCTTTTCCGCGTCAGCATCGGTTGGGACACCTGGCGCGAGGCCGGC 
ATGGCCGTCGATGCCGCCCGCGCCCGCGGCGACCAGGCCCCGCTCGAAGGGCTTAGCGACGAG 
CAGGGCTTGCGCCTGCTCGAAAGCGCCTTGGTCGGTTGCGAACCGCGACTCCTCGTCTCCATC 
AGCGAACTGCGCGCTCGACTAGCCGAGCATCATCGCAACGGCGGCATTCCCCGGTTGCTCGGG 
CCCCGCGCCAACGAGGCGGGTGCAGCTGATTCCGGCGAGGAGGGCGCCACGCAAGACGCGTCG 
CCGGCCCGTCGCGCCCGTCCCGATCTGGTCGTGGCCTTCGCGCCGGCCGGCAACGAGCTGGAG 
CGCCGGATCGTGGCCATCATCGGCGCCTACCTGCGGCTCGGTCAGGTGGGCGTCGACGACAAC 
TTCAACGATTTGGGCGCCACCTCGCTCGACCTCATCCAGATCGCCCAACGCCTCGGTCGCGAG 
TTGGGCCGCGATGTCCCTGTCGTC7CGCTCTACCAACACCGCACCGTACGCGGGCTGAGCCGC 
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TTCCTCGGCGGCGCGCTCCAATCCGCGCGGTCCGGCGTCCCGACGGGCGCTGCCGCACCGGGC 
GCCGCCACGCCGGGGGTTGCCACCCCGCCGCGGCCACAACCGTCGCGCCAGCACCTGGAAAAA 
CGCCGTCAATTGAGGAAAAAAGGGGGGCCTTCCCATCATGAG 

(2) peptide sequence 

Seq ID No 44 ( >pEPOcos6_ORF5 . pep) 

MKVDKRNVDD I LGLTPTQTG I LYHYLLDPQADAYFEQLTLHLEGPLDVAR FRRAWERWAAHD 
QLRA VFRWQG I EHPVQ 1 1 L KQHVPDLELAEVPRDADPAAFLAQWVAADRARKFDFETVP FRIG 
LCRTDTQHHVMLLSNHHXLMIXSWSTGLILRDFIACYGDSE^WRPRTRTHFKAFIKWHQNRPRR 
GEERFWRDLLRDAPDGGFPRI^VEEGTRHSLDFGARSRALDDRLTQGLRDMARDLDVTIAAML 
HTAWGLLbQRYQNSCEVIFGTTVSGRNVELAGLDEWGLFINTIPFRFSAAAATTPVEAFRAV 
QRNLLARSEFEATPLVDIKGWSGLGPGAELFDTILVT ENYPLDRAIFESDSSLRLTDHQIFER 
TNYGLTLTIETFSRLHVTI^HRRDLLGDAAAERMLDHFTGLLQAMLRFPHQPFARLEMKSEHE 
AHRVLHQLNQTRQPLPSQSAFHQLFFEQAQADGARPALWCGATRWTYGQLLERALRLAGRLQE 
AGFARGDVAAVSLGPVPDLIPGLLGPLFAGGAYLPLDPTLPAQRSRFILDDAGCRFLISDAPL 
AGPTPIHPDPAGASPVDVIFACQDGAAQPAYLIYTSGSTGQPKGVWVSHRNLINFLTGMSAIL 
PVAADDVFLSLTTVSFDIFGLETWFPLSRGCTIVLGTRAEQLDPAAAAKAISCHGVTVYQATP 
SRLQLQLEHPTFVRAIGSLTTLLVGGEPLPAELLRRVREVTDARIFNLYGPTETTIWSTAGEV 
TAADVPDIGRPlANTGVFLLARDGSIQPPGLVGELCIAGEGVALGYHRRPDIiNRERFREIPPG 
RLPFAGKLYHTGDIARWTEDGRLI.CLGRLDDQLKVRGHRVEPGEIEAVMARHPAVTQAVVVTR 
PRl'JGEPVLVGFWTAEGEPMPEEALSAYLADRLPSYMVPERCILMKAMPLTGNGKIDRRALPNP 
FALTE S TRQAAPRTLARTAGEHRVAE LWQALLRREAI GLDE P F FQAGGNS FGLI RLHAKLES A 
FGKS FP ITDLFQHTS I RSQAEMLSGS SVEAPLAGAVPQ PPAAAAQVAS SAAKS PGERGAAATS 
SGLTAQPPQPHFRPIAVIGLAGRFPAAPDLDAFLELLTEGRCGIRFFSQAELRDEGLDANRIA 
CHNYVPAKGFLDRADHFDADFFGIPPRDAEITDPQIRLLLECCWNALEHAGYPPGGGEIGLFA 
GS SANYHWLEYVGI SEESSNRFAVMI QNEKDYLATRIAYQLDLKGIAVTVQTACSS SLTAVEL 
ACDALHAGRVTMALAGGVGLTYPLRAGYLHEDGM I FS PDGRCRAFDAQAAGTVCGNGLGMWL 
KQLDAALADGDAIHAVIKGIAANNDGAAKIGYTAPSQNGQARVIRAAHRLAQVAPETIGYVEA 
HGSGTPLGDPIEVAGLTEAFDSPRRGFCALGSVKSNVGHLDAAAGIAGFIKAVLSLSHRTLFA 
SLI-r/DTPNFQIPFADGPFQVNTETRPWPAADHPRRAGVSSFGIGGTNVHAVLEEAPQLAEHAG 
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RRRERQLFLVSARTAADLERRTAALVRHLAAHPDLAPDDVAFTLHAGRKPMTHRRFLVAADLA 
EAAARLAEPDPVKSAAARADRCQVWMFAGLGSQYPGMCGGLYRTEPAFREQVDRCFDLLAPRC 
DLKPSLFPEPDQAIDASALAAIDTAQIAVFVCEYALARMLEGWGLRPDRLIGYSFGEYVAACL 

AGVFSLPDALAIVRERGRILAAAEPGAMVSVPLPAERVASLLEPPLALAIDNGPSCWSGPVE 
PVRTFTARMKRDRVVWTPLQAERPMHSPLMAEAGGSL^ 

DEQARDPDYWARHLCGNVRFADGVRTLLAERDPVFLEFGPGRDLSSLVRHQMPEGADEPIALI 
RHREDPVRDEDLLLDGLGRCFLRGATLHGQALYAGRGCRRVPLPGYPFQGPRCMPARAGLPGL 
ARPTVGATTISYRPAWKRAPRLAAVESLAPQSWLVFSDGSELAGELVAGLRASGCATTLVEGG 
LAFARFAGGFRANPREEQDLAQLFATLSAEAMLPTHILHLLSLPSPERDSPLARLEHLTELGF 
HHLLALARQLEAVGAPEVRLAVVTTGLAAIGGESELRPEVGLLRGPVRVIPFEFPNLRLRLID 
LDSADPIWRSGCEPLLREMGAAPGPEEIALRGTSRWELGYEPVEGGTVSTISSRLREGGVYLI 
TGGLGGLGLALARHLARKYRATLILAGRRGAPARELWHQAPAEFVPVAAAIAQMEECGARVIP 
VALDVTDADQVNALFATI EATVGKI EGVFHMAG I VDGGI I RTRTRAASDAVLAPKTVGTW I LD 
RALRGAGGRFLVLYSS I NAVVAPFGQVAYAAANAFLDAFASAHEHDERLFRVS I GWDTWREAG 
MAVDAARARGDQAPLEGLSDEQGLRLLESALVGCEPRLLVSISELRARLAEHHRNGGIPRLLG 
PRANEAGAADSGEEGATQDASPARRARPDLWAFAPAGNELERRIVAIIGAYLRLGQVGVDDN 
FNDLGATSLDLIQIAQRLGRELGRDVPWSLYQHRTVRGLSRFLGGALQSARSGVPTGAAAPG 
AATPGVATPPRPQPSRQHLEKRRQLRKKGGPSHHE 

pEPOcos6_ORF6 sequences: 

(1) nucleotide sequence 

Seq ID No 45 ( >pEPOcos6_ORF6 . seq) 

ATGAGTGAAGTATCCATTCGCCCCGGCTTGGACATCGCGGTCATCGGCATGGCCTGCCGCTTT 
CCCGGTGCCCGCAACCTCGCCGAGTATTGGGCCAACCTGATCGAAGGCCTCGAAACGCTCAGC 
TTCTTCAGCGAAGAGGAGCTGCGTGAGGCCGGCTGCGATCCGGTCCAACTGGCCCAGCACAAC 
TACGTGCGCAGCAAGGGCCTGCTCCCTGACGCAGACCGTTTCGACGCCGATTTTTTTGGTTAT 
TCCCCGCGCGAAGCCCAGGTGATGGACCCCCAGATCCGCGTCTTCCACGAGGTCTGTTGGCAG 
GCGCTGGAGCACGCGGGCTACAACCCGCATCGCCACACCGGCACGATCGGCCTGTTCGCCGGC 
GCCGCGCCCAACGTTTTTTGGGAGTTTCTCTCCTATCGGTCCGATGCCGCCAATTTAGGCAAC 
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TTCACGCTGGGCCTGCACAACAACAAGGACTACCTGAGCTCGCGCATCGCCTACAACTTCAAC 
CTGACAGGGCCCAGCTACACCCTGTTCACCGCCTGCTCGACCTCGATGGTCGCCATCCACCAG 
GCCGTCCAGGCGCTGCTCAACGGCGAATGCGACCTGTGCATGGCCGGCTCGGTCTCCATTACG 
CTGCCACTGGTTGCCGGCTACACCTACACGCCGGGCATGATCGTCTCGCCCGACGGCCATTGC 
CGCACCTTCGACGCAGGCGCCAATGGCACTGTCTACGGCGACGGGGCCGGCGTGGTCGTTCTC 
AAGCGGGCCGAGGATGCGTTGGCCGACGGCGACCACATATTTGCGCTCATCAAGGGCTCGGCG 
CTCAACAACGATGGCAGTCGCAAGACCGGCTACACCGCGCCCAGCGTGCAGGGGCAGGTGGAG 
GTGATCCGCGCGGCGATGAACCTGGCGGAGGTCGAGCCGGAGGCGATCAGCTACGTGGAAACC 
CACGGGACGGGCACCACGGTGGGCGATCCGCTGGAGTTCGAGGCGCTAAAGGAGGCCTTCGGA 
GGTGGCTGCAAGGCCTTCTGTGGATTGGGTTCGGTCAAGCCGAACATCGGCCATCTGGACGTG 
ACGTCGGGGATCGCGAGCTTCATCAAGCTGGTCCTGGCGCTGGAGCACCGCATCCTACCGCCC 
ACGCTCCACTTCCAACTGCCCAACCCGAAGATGGATGTGGTCGATAGCCCCTTCTACATCGTG 
GCTGAGCGCGAACCCTGGCGCGAAGATCTGCTGCCGCGTCGGGCCGGTGTCAGCGCGTTCGGT 
CTGGGTGGCACCAACGTCCACATGATTTTGGAGGAGTTTCAGCGCGAACCGGCGGCGAACAGC 
GCGCGCACGCGCCACCTGACGGTGCTGACGGCGCGGTCGCCGCAAGCCCTGGCGCAGCTGGCG 
GCCAACCTCGCCGAACACCTGCGCGAACACCCCGAGTTGGCGCTGGCCGATGTGGCCCATACG 
CTGCTGCACGGCCGCAAGCCACATCCATTCGCGCGCATCCTGGTGGCGACCGATACGACGGCG 
GCGATCGACGCCTTGATGAACGACCGCGATCCGCGAACGCGTTTCTTCGAAGCGACCGGGCGC 
GGCGAGTCGGTGATCCTGTGTTTTGACGAAACGCCGCCGGAGCCGCGAAGCGCCCGCTACCTC 
TGGGATCACGAGCCGCTTTATCGCGCGGCGGCGACGTCGTGCTTGGCTGGTGAGGTCGCCGAC 
CCGGATCTGGAAGGCTGCTTTACTGCCCTGATCGCCGAGCAGGGCGCGGCAGCCGCCTTTTGC 
CACCAATACGCGCTGGCCGGATGGCTGCTGGCCATGGGGTTGACCCCGTCGGCGTTGATCGGC 
GTGGGCCAGGGCGAGTGGGTAGCAGCGGCGCTCGCGGAGGTGTTCCCGCCATCGGCCTGCTTG 
CGCTGGATTAGGTTCGGCGAACGGCTCCCGCAGCCGCGCGATCAACGGATTCCGTTTCTCTCC 
AATTTCTCTGGAAACTGGATCGTTGGGCGTGAGTTGGCCGACCCGGATTACCCCAGAAAGCAG 
AAGGGTAAGCGCTGCATGAAGCGCCGTCGGTCCCAACCTCGGTCAGCTGGTGCAGGATGGGGG 
CGATGGAACCGGCTCGGTCAGCTCGTCGCGCGCTGCTCTTCCGCGGGAAGCGGAGGCGGGACG 
GTGATCGGCCCGAGGGCGAGGTTCATCTCGTCGTCGACGAGCCGGGCGCGGGTGCGCGCCCAG 
TACCTGGGGGCGAGCTCGAGG 
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(2) peptide sequence 

Seq ID No 46 ( >pEPOcos6_ORF6 . pep) 

MACR F PGARNLAEYWANL I EGLETLS F ? SEE ELREAGCD PVQLAQHNYVRTKGJjL PD ADRFDA 

DFFGYSPREAQVMDPQIRVFHEVCWQALEHAGYNPHRHTGTIGLFAGAAPNVFWEFLSYRSDA 

ANLGNFTLGLHNNKDYLSSRIAYNFNLTGPSYTLFTACSTSMVAIHQAVQALLNGECDLCMAG 

SVSITLPLVAGYTYTPGMIVSPDGHCRTFDAGANGTVYGDGAGVVVLKRAEDAIADGDHIF 

I XGSALNNDGSRKTGYTAPSVQGQ VEVIRAAMNLAEVEPEAI SYVETHGTGTTVGDPLEFEAL 

KEAFGGGCKAFCGLGSVKPNIGHLDVTSGIASFIKLVLALEHRILPPTLHFQLPNPKMDVVDS 

PFYIVAERE PWREDLL PRRAGVS AFGLGGTNVEMI LEE FQRE PAANS ARTRHLTVLTARS PQA 

LAQLAANLAEHLREHPEIALADVAiiTLLHGRKPHPFARILVATDTTAAIDAL 

EATGRGESVILCFDETPPEPRSARYLWDHEPLYRAAATSCLAGEVADPDLEGCFTALIAEQGA 

AAAFCHQYA1AGWLLAMGLTPSALIGVGQGEWVAAALAEVFPPSACLRWIRFGERLPQPRDQR 

IPFLSNFSGNWIVGRELADPDYPRKQKGKRCMKRRRSQPRSAGAGWGRWNRLGQLVARCSSAG 
S GGGTV I G PRARF I S S S T S RAR VRAQ YLGAS S R 

pEPOcos 6_ORF7 sequence s : 

(1) nucleotide sequence 

Seq ID No 4 7 ( >pEP0cosS_0RF7 . seq) 

ATGGAACCGGCTCGGTCAGCTCGTCGCGCGCTGCTCTTCCGCGGGAAGCGGAGGCGGGACGGT 
GATCGGCCCGAGGGCGAGGTTCATCTCGTCGTCGACGAGCCGGGCGCGGGTGCGCGCCCAGTA 
CCTGGGGGCGAGCTCGAGGTAGCGGTCCCGCGGCCAGTAGGGCATCGCGCGAATGACGTCGGC 
CAGGTAGGCCTCCGGGTCGAGCCCGTGCAGCTTGCAGCTCGCCACGAGCGAGAAGAGGTTGGC 
CGCGGCGGAGGCGTGGTCGTCGCTGCCGAAGAAGAGCCAGGACTTTCTCGCAACCGCAATGGA 
TCGCAGCGCTCGCTCGCTGGCGTTGTTCTCCAGGCGCAGCCGACCGTCGTCGAGGAAGCGCCG 
CAACGGCTGCTCTTGGTTGAGGGCGTAGCCGAGCGCGGTGGAGACCAGGCCGCGCTCGCGGGG 
ACGAGCGTGCTCGGCCCTGGCCCAGGCAAAGAACGCGTCGACCAGAGGGCGGACGACGACATC 
GCGACGCACCTTGCGCTGCGCGGGCGGCAGGTCCGCCAGCGCGCGATCGGCGGCAAAGAGGGC 
GTTGATGCGCCGCAGCCCCTCGACACCGAGCTCGTGCTTGCAGACCGCCGCCTCCCAGAAGTT 
GGTACGGCAATGCGACCAGCATCCGACTTCGGTCGGGGGCGGACCGCGCTTCTCGTCGGCAGC 
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AGCGCCTCTTGGTGGTGTGCCGCGGAAGAGGGCGTCATAGATGGCGTGAGCGTCAGCTTGAAT 
ATACCGAGAGAAGCCGCGGAACATCTCGCAGACCGCGGCGCTGGTATGCTTGGGCTGGTACTC 
GAAGAAGACGTGATCCTTGTCCGCGAGGACGACGAAGAAGTGTCCCTTGCGGCACGGCCCGGG 
CTTCTTGTCCTTGCGCTCCTGGATGGGCCCAGGCTGGACGGAGACCCCGGTGGCGTCCGTGGA 
CAGGCAGAAGGCGGTCTCGAAGGCCTCTTTGCGCGCGGCCTCGACGATGGCGCCCAGGGTCGC 
ACCGACGTCTTCGGCGTAGCGGCACATCGTGCCGCGATCGAGCGACGCGCCCTGAAGCTCCAG 
CTGCTGCTCCAGTCGATAGAACGGGACGCCGAGCAGGTACTTGCTGGTGAGGATGTGCGCAAT 
CATCGACGGCGCGAGGAACGACCGCCGGAACAACTCCTTCGGAAGCGGCGTCGTGATGAAGAC 
CGTGCAGGTCTCGCCCTTCGGCGCCGGCGGCGGCGCGTCGAGCGAGGGCGCGTCGAGCGCTGT 
GGAGGAAGCGCTCGGCTCGCCGGCCGCTGCCGTGTCCTCCGGGCTGACGCTCGCAGCGGGCGT 
CGGCGTCGGGGCTTCTCTCGCGACGACCTGGAGCGGGGCCGCTTCCTCCTCGCCCGAACTGCT 
CGCATCCGTGACGGACCGCTCGGCCTTGTACACGACGCGTGCGAGCACGATGCGGCGCATTCC 
GCCGCGCTCGTAGCCGAGTCGCGAGGTCTCCTCGACCCCGATGCGCGTCGCCGTCGCATCGAG 
CTCGGGGCAGGAGAGCTCGATGCGGACGACGGGCAGGTCGGACTCGGACAGGTCGCGACGGCC 
CTTGCCGCCGGACCTTCGTTTCGGCCCCTTGGGGTCGTCGTGCTGCCGCTCGTCGCCTGTATT 
GCGCTCGGCGGCGTCGAGTGCCTTCGCGAGGCGCTGGACCTCGAGGAACATCGAGTCGAACGC 
CAGCTGCTCCGCGCTCACCTCGGCGCGCTCCGCCTTGGCCACGAACAGTCGACGTCGCAGAAG 
CTGCAGCTGCTCGAGCGGACGGGTGTAGGCGCGCCGAAGCTGCGCGAGCGCATCGCGCGCTCC 
CACGAGCTCGCTCTTTGCCGCGGCGAGCTCCGCTTCGAGCTGCGCGATGCGCTGCTGCTCGGC 
CGAGAGCGTCGGCTTGGCGGCGGCGTCGTGCACGACGCCGCTCTACGTAAGCCGCGCGTACTT 
GTCGAGCGAATTCGTGCGGCTCAGTGGACGCGGCGCGGTGCGCGCCTTCGCGGTTTGGACGTG 
GGCGCGATCTCGATGCCGTCGAGCAGCGTCTCGAGCGTGGCGTCGTCCACCTCGACGTGCGTG 
GCGCCCTCGGTCGGGGGGTCGGGAAGTGCGAACGCTCCGCGATCAAGGCGTTTTGAAAACAGG 
CAGATTCCACTGCCATCGAAGAAGAGAATCTTGATCGTGGTCCGCCGCTTGCCGACGAACGCG 
AACAGCGCTCCGCAGCGAGCCTCGTACCCCACACGCTCACGGATGAGACCCGAAAGCCGCTCG 
AAGCCG 
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(2) peptide sequence 

Seq ID No 48 (>pEPOcos6_ORF7 . pep) 

MEPARSARRALLFRGKRRRDGDRPEGEVHLVVDEPGAGARPVPGGELEVAVPRPVGHRANDVG 
QVGLRVEPVQLAARHEREEVGRGGGVWAAEEEPGLSRNRNGSQRSLAGWLQAQPTWEEAP 
QRLLLVEGVAERGGIX?AAIiAGTSVLGPGPGKERVDQRADDDIATHLALRGRQVRQRAIGGKEG 
VDAPQPLDTELVLADRRLPEVGTAMRPASDFGRGRTALLVGSSASWWCAAEEGVIDGVSVSLN 
IPREAAEHLADRGAGMLGLVLEEDVILVREDDEEVSLAARPGLLVLALLDGPRLDGDPGGVRG 
QAEGGLEGLFARGLDDGAQGRTDVFGVAAHRAAI ERRALKLQLLLQS I ERDAEQVLAGEDVRN 

HRRREERPPEQLLRKRRRDEDRAGLALRRRRRRVERGRVERCGGSARLAGRCRVLRADARSGR 
RRRGFSRDDLERGRFLLARTARIRDGPLGLVHDACEHDAAHSAALVAESRGLLDPDARRRRIE 
LGAGELDADDGQVGLGQVATALAAGPSFRPLGVWLPLVACIALGGVECLREALDLEEHRVER 
QLLRAHLGALRLGHEQSTSQKLQLLERTGVGAPKLRERIARSHELALCRGELRFELRDALLLG 
RERRLGGGWHDAALRKPRVLVERIRAAQWTRRGARLRGLDVGAISMPSSSVSSVASSTSTCV 
APS VGGSGSANAPRSRRFENRQ I PLPS KKRI L I WRRLPTNANS APQRAS YPTRSRMRPESRS 
KP 

pEPOcos6_ORF7 . 1 sequences : 

(1) nucleotide sequence 

Seq ID No 4 9 { >pEPOcos6_ORF7 . 1 . seq) 

ATGTTCCTCGAGGTCCAGCGCCTCGCGAAGGCACTCGACGCCGCCGAGCGCAATACAGGCGAC 
GAGCGGCAGCACGACGACCCCAAGGGGCCGAAACGAAGGTCCGGCGGCAAGGGCCGTCGCGAC 
CTGTCCGAGTCCGACCTGCCCGTCGTCCGCATCGAGCTCTCCTGCCCCGAGCTCGATGCGACG 
GCGACGCGCATCGGGGTCGAGGAGACCTCGCGACTCGGCTACGAGCGCGGCGGAATGCGCCGC 
ATCGTGCTCGCACGCGTCGTGTACAAGGCCGAGCGGTCCGTCACGGATGCGAGCAGTTCGGGC 
GAGGAGGAAGCGGCCCCGCTCCAGGTCGTCGCGAGAGAAGCCCCGACGCCGACGCCCGCTGCG 
AGCGTCAGCCCGGAGGACACGGCAGCGGCCGGCGAGCCGAGCGCTTCCTCCACAGCGCTCGAC 
GCGCCCTCGCTCGACGCGCCGCCGCCGGCGCCGAAGGGCGAGACCTGCACGGTCTTCATCACG 
ACGCCGCTTCCGAAGGAGTTGTTCCGGCGGTCGTTCCTCGCGCCGTCGATGATTGCGCACATC 
CTCACCAGCAAGTACCTGCTCGGCGTCCCGTTCTATCGACTGGAGCAGCAGCTGGAGCTTCAG 
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GGCGCGTCGCTCGATCGCGGCACGATGTGCCGCTACGCCGAAGACGTCGGTGCGACCCTGGGC 
GCCATCGTCGAGGCCGCGCGCAAAGAGGCCTTCGAGACCGCCTTCTGCCTGTCCACGGACGCC 
ACCGGGGTCTCCGTCCAGCCTGGGCCCATCCAGGAGCGCAAGGACAAGAAGCCCGGGCCGTGC 
CGCAAGGGACACTTCTTCGTCGTCCTCGCGGACAAGGATCACGTCTTCTTCGAGTACCAGCCC 
AAGCATACCAGCGCCGCGGTCTGCGAGATGTTCCGCGGCTTCTCTCGGTATATTCAAGCTGAC 
GCTCACGCCATCTATGACGCCCTCTTCCGCGGCACACCACCAAGAGGCGCTGCTGCCGACGAG 
AAGCGCGGTCCGCCCCCGACCGAAGTCGGATGCTGGTCGCATTGCCGTACCAACTTCTGGGAG 
GCGGCGGTCTGCAAGCACGAGCTCGGTGTCGAGGGGCTGCGGCGCATCAACGCCCTCTTTGCC 
GCCGATCGCGCGCTGGCGGACCTGCCGCCCGCGCAGCGCAAGGTGCGTCGCGATGTCGTCGTC 
CGCCCTCTGGTCGACGCGTTCTTTGCCTGGGCCAGGGCCGAGCACGCTCGTCCCCGCGAGCGC 
GGCCTGGTCTCCACCGCGCTCGGCTACGCCCTCAACCAAGAGCAGCCGTTGCGGCGCTTCCTC 
GACGACGGTCGGCTGCGCCTGGAGAACAACGCCAGCGAGCGAGCGCTGCGATCCATTGCGGTT 
GCGAGAAAGTCCTGGCTCTTCTTCGGCAGCGACGACCACGCCTCCGCCGCGGCCAACCTCTTC 
TCGCTCGTGGCGAGCTGCAAGCTGCACGGGCTCGACCCGGAGGCCTACCTGGCCGACGTCATT 
CGCGCGATGCCCTACTGGCCGCGGGACCGCTACCTCGAGCTCGCCCCCAGGTACTGGGCGCGC 
ACCCGCGCCCGGCTCGTCGACGACGAGATGAACCTCGCCCTCGGGCCGATCACCGTCCCGCCT 
CCGCTTCCCGCGGAAGAGCAGCGCGCGACGAGC 

(2) peptide sequence 

Seq ID No 50 ( >pEPOcos6_ORF7 . 1 . pep) 

MFLEVQRLAKALDAAERNTGDERQHDDPKGPKRRSGGKGRRDLSESDLPVVRIELSCPELDAT 
ATRIGVEETSRLGYERGGMRRIVLARWYKAERSVTDASSSGEEEAAPLQWAREAPTPTPAA 
SVSPEDTAAAGEPSASSTALDAPSLDAPPPAPKGETCTVFITTPLPKELFRRSFLAPSMIAHI 
LTS KYLLGVPF YRLEQQLELQGAS LDRGTMCRYAEDVGATLGAI VEAARKEAFETAFCLSTDA 
TGVSVQPGPIQERKDKKPGPCRKGHFFWLADKDHVFFEYQPKHTSAAVCEMFRGFSRYIQAD 
AHAI YDALFRGTP PRGAAADEKRG? P PTEVGCWSHCRTNFWEAAVCKHELGVEGLRR I NALFA 
ADRALADLPPAQRKVRRDVWRPLVDAFFAWARAEHARPRERGLVSTALGYALNQEQPLRRFL 
DDGRLRLENNASERALRSIAVARKSWLFFGSDDHASAAANLFSLVASCKLHGLDPEAYLADVI 
RAM P YWPRDRYLELAPR YWARTRARL VDDEMNLALGP I TVP PPLP AEEQRATS 
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pEPOcos6_ORF7 .2 sequences: 

(1) nucleotide sequence 

Seq ID No 51 (>pEPOcos6_ORF7 . 2 . seq) 

ATGATTCCGGCGGGCGTGCAGGTGTTCGTCGCGCTGGAGCCGGTGGACATGCGCTACGGCTTC 
GAGCGGCTTTCGGGTCTCATCCGTGAGCGTGTGGGGTACGAGGCTCGCTGCGGAGCGCTGTTC 
GCGTTCGTCGGCAAGCGGCGGACCACGATCMGATTCTCTTCTTCGATGGCAGTGGAATCTGC 
CTGTTTTCAAAACGCCTTGATCGCGGAGCGTTCGCACTTCCCGACCCCCCGACCGAGGGCGCC 
ACGCACGTCGAGGTGGACGACGCCACGCTCGAGACGCTGCTCGACGGCATCGAGATCGCGCCC 
ACGTCCAAACCGCGAAGGCGCGCACCGCGCCGCGTCCAC 

(2) peptide sequence 

Seq ID No 52 ( >pEPOcos6_ORF7 . 2 . pep) 

MIPAGVQVFVALEPVDMRYGFERLSGLIRERVGYEARCGALFAFVGKRRTTIKILFFDGSGIC 
LFSKRLDRGAFALPDPPTEGATKVEVDDATLETLLDGIEIAPTSKPRRRAPRRVH 

pEPOcos6__ORF7 . 3 sequences: 

(1) nucleotide sequence 

Seq ID No 53 ( >pEPOcos6_ORF7 . 3 . seq) 

ATGACAAGGACGAAGGCGACCGAAGTGATGTGGTCCGAGCGCGTTCGGGCGTGGCGCGAGAGT 
GGTGAAACGGCGGAGGAGTTCGCTCGGAGCCGCGGATTTGCGGCCTCGACGCTGCACGGCTGG 
TCGAGCCGGCTGTCGCGGGCCGAGCCACCGCGCTTTCTGCGCCTGGTGCCGAAGGCGCCCGCC 
GTGACGAGCAGCGCTGCGGAGCTCGTCGTCGAGGTCGGCGGCGCGCGGGTGCGCGTCGCCGCG 
GGGTTCGACCCCGCGCTGCTGGCGGAGGTGGTCCGTGCCCTCGGCGGAGCGGGGCGA 

(2) peptide sequence 

Seq ID No 54 ( >pEPOcos6_ORF7 . 3 .pep) 

MTRTKATEVMWSERVRAWRESGETAEEFARSRGFAASTLHGWSSRLSRAEPPRFLRLVPKAPA 
VTS SAAEL WEVGGARVRVAAGFDPALLAE WRALGGAGR 
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pEPOcos6_ORF8 sequences: 

(1) nucleotide sequence 

Seq ID No 55 ( >pEPOcos6_ORF8 . seq) 

ACTGGACAGCGCAGCCGGGGTGAGACGGCGCTTCGCGCAGCGCTTACGCAGAAGGCGCGCCGC 
GCGCCATTGTCGGATGCGGTGCGCGACTTCGCCGCCGATCGGCTGTTGCTGGAACTGGGACAA 
CCACTGGACGTAACGGCTGAAGCGAGCCAACGGCTCCAGCTCGCGCGGGGCGACCTGTTCGGC 
GCCTACCAAGCGTTGGvCCAGCTCTGGATCTGCGGCGCCCTGGCCGAACCGCCGCGACTGTAT 
CCCGACGAACACCGCCGGCGCGTGCCGCTGCCGAGCTACCCCTTCGAGGGAAAGCGGTTCTGG 
ATCGAGGGCTCGCCGTTCGAAACCGCGCCCGCCGCCGGCGCCTCACCCCAACCCGCCGATTCG 
GGGGACATTCTCAAGGGCGACCCGGCGGACTGGTACTATCGGCCGCGTTTCGAAGCGGCGCCG 
CTCTTGCCCAGCCCGTTCGAGAGCGAACCCGGCGATTGGCTGGTGTTCGAAGATGAGCTGGGG 
CTCGGCGCCTGGCTGAGCGAGACCTTGCGCGACAAGGGCGCGCGGGTCGCGACAGTCGTTCGA 
GGCACCGAGTTCCGACGCCTGGCGTCACAGCGCTTCCAGCTTCGTCCCGATCGACGGGACGAT 
TACCGGACCCTGCTGCACGAGTTGAAGGCGCAGGGCATCGCGCCGGTCCACCTGTGCCACCTA 
TGGAGCGTGACCGCCGCACCGGATGCCGAGCAGTTGCTCGACGTCAGCTTTCACAGCCTGGTC 
CATTTGGCGGCCGCTTTGGGTTCGGTTGGCTACTTCCACGCCATG 

(2) peptide sequence 

Seq ID No 56 ( >pEPOcos6_ORF8 . pep) 

TGQRSRGETALRAALTQKARRAPLSDAVRDFAADRLLLELGQPLDVTAEASQRLQLARGDLFG 
AYQALAQLWICGALAEPPRLYPDEHRRRVPLPSYPFEGKRFWIEGSPFETAPAAGASPQPADS 
GDILKGDPADWYYRPRFEAAPLLPSPFESEPGDWLVFEDELGL.GAWLSETLRDKGARVATWR 
GTEFRRLASQRFQLRPDRRDDYRTLLHELKAQGIAPVHLCHLWSVTAAPDAEQLLDVSFHSLV 
HLAAALGSVGYFHAM 
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pEPOcos6_ORF9 sequences: 

(1) nucleotide sequence 

Seq ID No 57 { >pEPOcos6_ORF9 . seq) 

ATGAAGTTGAACGTGGTCGCCAACCGGCTATTCGACCCCGAGTCGCCCGAGCGCACCGAGCCC 
GCCAAGAGTCTGTTGCTCGCGGTGACCAAAGTCCTGCCGCAAGAGGTGCCCAACGTTCGAACC 
CGCGCCATCAGCGTGGACCTGGATCGCTCGTTCGACGCGGCGGCGCCCGCCTGGGCCGCCAGT 
TTGTTGGTTGAATGCGGCGCGCCCGTCGAGGAAACGGTGGTGACCTACCATGGCGCAGCCCGA 
TGGCTGCGCCGCTTCGATCGCGTTGCGGTGAATGGTCTCGGCCCGTTCCACCCCGATCAACCT 
GCGCCGCTGCTGCGCGAGCGCGGCG7GTACCTGATCACCGGCGGCCTGGGCGGCGTGGCTGGC 
CAGTTGGCGCGCTACCTGGCGCGGGCCTGCCGGGCGCGGTTGGTGCTCACCGCGCGCCGGCCC 
CTGCCCGAGCGCGACCAGTGGGATCGGGAGTCGGCCGTGCTGTCATGGGACGACAAGACGCGC 
CAGCGCATCGAGCTGGTGCGCGAGCTGGAGCGGCTGGGGGCCGAAGTATTGGTGGTGGCTGCC 
GATGTCGCCGACGAAGCGGCCATGGCGCAGGCGATCGAGGCCTCACTGGCGCGATTCGACGCT 
TTGGACGGCTTGATCCACGGCGCCGGGATCGTGCGGGTCGCGTCGGGCCGCACGCCGATCGGG 
AGTATGACGCGGGCCATGTGCGAGGAGCAGCTCCGCCCCAAGATGTTGGGCCTCGACGTCGTC 
GACCGCCTCCTGCGCGATCGCCGGTTGGACTTCCGCATTGCCATCTCGTCGCTCGCCCCGATT 
CTCGGCGGCCTCGGCCACGTCGCCTACGCCGCCGCCAACCTCTACATGGACGCGTTCGCGACG 
CGCGCCGCCGCCGGCAACGCGCCTTGGATCGCGCTGAACCTGGCCGAGTGGGAATACGAGGGC 
CCGGCTACCTACGACGAGCGGGTGGGCCGTTCGCTCAAGCAGCTC3AGCTCACCAACGAGGAG 
GGTATCCGCGTCTTCCAGACGGTGTTGGCCTTGGCCGCGCGCGGCCCGCTACAGCAGATCATT 
ATTTCCACCGGCGACCTCCAGGCCCGCCTCGACAAATGGATTCACATCAAATCCCTGCATCGC 
CGACCGGGGCCGGTCCAGCTCAGTCGCCGGACCGCGGCACCCCAGGGCGGTTTCGGCTCGGAG 
CGCGCCGCCTTCGAGGCCGCCTTCGCTGACGCCTGGTGCGACTTCTTCGGGGTTGAAGAGGTC 
GACCCGAACAAAAACTTCTTCGATCTGGGCGCCAGCTCGCTCGACTTCATCCACCTCGTCAGT 
CGCTTCAGCAAGGCCATCGAACAGCATGTACCGCTCGAGGCCCTGCTCGAACACTCCACCCTG 
CACGACCTCGCCGCCCACCTCGCGGGCGACGCGAACACCGACGCCAGCGACGAAGCGCGCATT 
CGCCAACGGCTGCAAGGCGCCAAGTCCGGCGACATCGCCATCATCGGCATGGCCGGCCGCTTC 
CCGCTCGCGCCCGACCTGGACACCTATTGGCGCAACCTGGTCGGAGGCATCGACGCGGTCAGC 
TTCTTCAGCGCCGAGGAGTTGCGTGCTGCTGGCGTCACCGCGGCCGAGATCCACCACACCAAC 
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TACGTGCCGGCCAAGGGGCGCTGC3CCGACCAGGACTTGTTCGATGCGGCCTTCTTCGAATAC 

ACTGCCAGCGACGCCGAGCTGATGGACCCGCAAAATCGCGTGTTACACGAGGTCGTGTGGCAC 

GCGCTGGAAGACGCCTGTTTCGACTTCAACGGCGATCACGGCCAGGTCGGCCTGTTCGCGGGC 

GCCTCGCCGAACCTGTGGTGGCAGTTCGTGGCCAGCTTTTCCGAGGCCGCCAAGACGCAGGGC 

ATGTTCACCACCACCCTGCTCAACGACAAGGACTCGATCGCGACCCAGATTTCATACAAGCTC 

GGTCTAAAGGGCCCCGCGGTCACCTTGTTCACCGGCTGTTCCACCTCGCTGGTAGCCGTTGAC 

GCCGCCTGCCGCTCGATCTGGTCCGGTCAATCGGACATGGCCGTGGCCGGCGCGGTCTCGCTG 

ACTCTCCCCGATAAGGCCGGCTACATCTACGAAAAGGGCATGCTCTTCTCGGCCGACGGCCAT 

TGCCGGGCTTTCGACGCCAACGCCACCGGCATGGTCTTCGGCGACGGCGCCGGCGCGATCGTG 

CTCAAGCCGTTGGACGCGGCCCTC-CGCGACGGCGACCCGATCCATGCGGTGATCAAGGGCTGC 

GCCACCAACAACGACGGCGACCGCAAAGCCGGCTACACGAGCGTCAGCGCCCAAGGCCAGGCC 

GAGGTGATCCGCTCGGCCCAGATCCTGGCCGACGTGGCGCCCGAATCCATCAGCTACGTGGAA 

GCCCACGGTACCGGCACCAAGTTGGGCGACTCGATCGAGATCAAGGCGTTGAAGCAAGCCTTC 

GCCAGCGACAAGAACGGATTTTGCGGCATCGGGTCGGTCAAGACCAACCTCGGTCACCTGATG 

GCGGCGGCGGGGATGGCCGGCCTGATCAAGACGGTTCTGGCGATGAAGCACCGCCAATTGCCG 

CCATCGCTGCACTGCGACGAAGTGAACCCCGACCTGGAGTTGGAGCGCAGTCCGTTCTACATC 

AACACCCGCCTGCGCGACTGGGTTGCACCGGGCGGGCCGCTGCGGGCCGGCGTGAGTTCGTTC 

GGGATCGGCGGAACCAACGCTCACGTCATCCTGGAGGAGCCGCCGACGCGCGAGAGCGGCACG 

CGCATGCGCCACTGGAAATTATTGATGCTGTCGGCGGCCAGCGAGGCGGCGCTCGACCGCCAG 

GCCGATAACCTGGCCGACTACCTGGAGCGCCATCCCGAGGCCCACCTCAGCGACGTGGCCTAT 

TCCCTCCAGACCGGCCGGCGCGTTCTGGCCTGGCGGCGCACGGTCCTATGCGAGTACCGCGAG 

GACGCGGTGACCAGTCTGCGCGAGCGACAGGCCAAGCGCGTCCAGACAAGTCGCGTCCGCTGG 

GACCACAAGGACGTGGTCTTCATGTTTCCCGGTCAGGGCGCCCAGTACCTCAACATGGGCCGC 

GACTTATACGTCATGGAGCCGGTCTTCCGCGAGGTCATGGACCGCTGCTTCGAGTTGCTGGCC 

CCTTTGTGGTCCGAGCATCCGCGCCAGATCCTTTATCCGGAGGGCGGGGTGTCGACCCTGCTC 

CACCGGACTGATTACACCCAGCCGATCGTGTTCTGCTTCGAGTACGCCCTCGCCCATTTGCTG 

CTCTCCTGGGGATTGAAGCCGGCCGCGACCATCGGCTACAGCTTCGGCGAGTACGTTTCTGCC 

TGCCTCGCCGGCGTCTTCTCCCTGGAAGATGCGATCCGTCTGGTGACCGAGCGCGGTCGGCTG 

ATGGCGGCTTTGCCCGCGGGCGCCATGCTCAGCGTCCCGGTTCCCGAATGCGAGCTGCTGCGG 

CTGCTGGACGGCTTCCACGCCCAATCGGCGGCCCATCTGGCGCTGGCCGTCGACAATGGCGCC 
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TCCTGCATTGTGGCCGGCGAGCAGGCCGCCATCTCGGCCTTCGAATCGATGCTTCGCAAGAAG 

CGTCTGTTGACCATGCGGGTCGCGGTCAGCCACGCCGCTCATTCGCAGGTCATGACCGGCGCG 

ACCGACGCCCTGCGCAGCATCCTGCGGAAGATCCCCC7CTCCGCGCCGACAATTCCCTTCATT 

TCCTGCGTCACCGGCACCTGGATCACTGCACAGCAGGCTACGGATCGCGAGTATTGGGTGAAC 

CACATGTGCGGGACGGTGCGGTTCGCGGCGGGTCTGACCGAGCTGGGTCAAAACCGCGAGGCG 

GTGTTCCTGGAAGTAGGTCCGGGCCGCGACTTGACGTTGCTGGCCCACCGCATCCTGGCCGAC 

AGCGCGGCCGTGTTCGAGCTGGTCAAGGCGCCCGACGGCGGCGACGACGATGGGTTCCTCCTG 

CTGGATCGArrGGCCAAGCTCTGGAGGCTGGGGATTTCGATTGACTGGGCCGGCTTCTACGCG 

GATGAGCGGCGGCGGAAACTCTCGCTGCCGGGATATCCGTTCGAGCGGCGGCGCTTCTGGATC 

GAGGGCAACCCGCTGGAGATCGCCGCCGGCAGGCCCAATGTCCAGGGGCCGCTGGTCAAGGCG 

TCGGACATCGGCGCTTGGTTCTACGTGCCGCAATGGCGGCGGTCGGTGCTCGCCGAGCCGGGT 

ACAACGGCGGCGGGCGCCGCCGTCACGGCGGAGCAGGCACGCGTCGTGACCGAGCTACGGGCG 

GGATGCGCGTCGGCCGGCTTGGGCAGCGGGGCCTGCGGACTGAATGGCGGTGCCCCGTCCGAG 

CGTCCGAAAGAAAGTGTAGCGCCAGCCGGGTCGACCAGCGCAGCGGCGCAGACCGGCGCGGAC 

TGCCCGACACCGACTGGGGAGCCAGCGGCTGTGCCAAAGGACGGGGCCGAGCCGCGGCCGACC 

TGGCTTATTTTCGCCGACGCCGGCGGATTGGCCGAATCTTTCGCCAAGCGGGTTCAGGCCCGC 

GGCGAGAAGCTTTACCTGGTGGCTTCCGGCTCGCGCTTCGAGCGCCTGGCCGAGACCCGCTTC 

CGCCTCGATCCCGGGGCCAAGTCCGATCACCGCCTGCTTTTCAAGGCGCTCGACGAGGCCGAC 

ATCCTGCCGACCCACCTCCTCGACTTCCGCTCGCTTGACTGCGGCGGGCCCGACGCCGACCCC 

ATGGACCAGGCCGGCTTCTTCGGGCTGTTGCACCTGGTCCAGGCGATGGCAGAGGCCGGCTAC 

AGCCATCCCATTCGGCTGCTGATCGTCAGTTGCGGCGTCTACGATGTCACCGGTGCCGAACCG 

CTGCAGCCGGCGCGGGCCACGATGATCGGACCGGCTCTGTGCATCCCGCAACAGTATCCGCAC 

CTCGAAACGAGCCATGTGGATTTGGGCGTGGTCCATGCCGACGAGCTCCACGCCGCGCGCCAG 

CTCGACAGCCTACTTGCCGAATGCCTAAGTGCAACGGCCGAGCGCCAATTGGCGCTGCGCGGC 

CGACACCGCTGGCTGCTGGACTACGAGCCAGTCCGCTTGCCGCCGCTCGACCCGGGCCGTCTG 

CCCTGGCGCCAGCGCGGGGTCTACTTGATCACCGGCGGTTTGGGCGGGATCGGCCGCATCCTG 

GCCGAACACCTGGCCCGCACGACCTCGGCTCGCCTGGTCCTAATCGGCCGCGAAACCCTGCCC 

GACCGCGACGACTGGGACGCCTGGCTGAACCGCCCGCAACCGGTCGACGCCACCCACGAACGG 

CTGCTGCACAAGATCCGCGCGATTCGCGATCTGGAAGCGCTAGGCGCCGAAGTCCTGGTCCTC 

GCC3CCGACGTCGCCAACGAAGCCGCCATGCGCGAGGCCTACGATCGCGCCGAATCCCACTTC 
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GGCACAATCCACGGGGTGATTCACGGCGCCGGCCTGATGGACGCGCAAAGCTTCTCACTGATC 
GACGCCCTCGACCACGACCTCTGCGCCCGCCAGTTCGAAGCAAAAATCCGCGGCGTCTGCGTG 
CTCGACCGCGTTCTGGCCGACCGCACGCTCGACTTCTGTCTGCTGATGTCTTCCATCTCCACC 
GTGCTCGGCGGCCTGGGCTATTTCGGTTACGCCGCGGCCAACGCCTTCCTCGACGCCTTCGCC 
CAGGCGCGCAGCCGCGACGCCGCTTTCCCCTGGCTTAGCGTGGCCTGGAGCGATTGGAAGTAC 
TGGACCGAGCGCAAGATGGACAACGAGGTCGGCGCCGTCATCGACAGCCTCTCGATGGAACCC 
GCCGAGGGCTTCGAAGCCGTCACCCGCGTCTTGGCTTGGGGCAAGGCGCCCCACATCGCCAAC 
TCGCCCGGTGACCTCGGTCGCCGCCGGGATCAATGGGTCAAACTGGCCAGCCTGAAATCGGCG 
CACTCCAGCGAGCCCGAGCCGGCTAGGCATGGACGTCCGGCGCTCTCCAGCGAATGGGTCGCG 
CCGCGCAACGTGGTCGAAGAGAAGCTGGTCGCCATTTTCGAGCAGGTGTTCGGCACTGCGGCA 
CTGGGCATCGAGGACAACTTCTTTGAGTTGCGCGGCGACTCGCTCAAGGCGGTCATGACCGCG 
GCCCGTATTCAAAAGGAGCTGAACGTGGAAGTGCCGCTGCCGACCTTCTTCCAGATGCCCACG 
GTCGCTGGCCTGGCCCAGTTCGTGACGCAAGCCAAGCGCAGCGGCCGGGAGACGATTCGGCGC 
ACCGCGCCGCGCCCACATTACCCGCTCTCGGCTGCCCAGGGCCGCCATTACCTGCACTACCGC 
ATGGACCCGCGTTGTACCGCATACAACGATCCCTTCGCCAACCTGATCGAGGGTCCGCTGGAC 
GTGGATCGCGTGGAGCGCATCCTGCACACCCTCATCCTACGCCACGACTGCTTCCGCACCTCG 
TTCCACTTCCGCGAGGGCGAGCCGGTCCAGGTGATTCACGATCGGGTGGACTTCAACCTGGCG 
CGGATTACCTGCGCGCCCGAGGATTTGCCCGAACGGATGCGCGATTTCATCCGCTCCTTCGAT 
CTGGAGCGACCGCCCGCCATGCGCGCCGGCCTCTTCGTCACGGGGCCCGAGCGCCACGTGCTG 
CTAATCGATTTTCACCACATTATCACCGATGGCGTGTCGTTCGAGAACTTCGTCGGCGAGTTC 
GCGGCGCTCTACCGCGGCGAGATCCTGCCCGAGCTGGAACTCGAGTACAAGGATTTCGCGGTG 
TGGCAGCATGAGAACCGGGGCCGCCGCGCCAACAGCGACCAGGCCCGCTACTGGACCGAGCAG 
TTGGCCAATGCGCCCGGGCCGATCGAGCTAACCACCGATTTCCCCCGTCCCAGTCGACGCAGC 
TTCCGCGGCGACCGCGTGCGGACCGTGCTTGATGCGGAGCTCGTTGCTCGACTCAAAGAGCAC 
GCGGCGCGCCTCGGCATCACCCTCTATAGCCTGCTGCTGGGCGGATTCTCGTTATTGCAGCAC 
AAGCTCTCCGACTCGCACGACATCGTCATCGGTTCGCCCGTCGCGGGCCGCACCCGGAGCGAA 
CTCCAGGATCTGCTGGGCGCGTTCGTCAACACCCTGCCGATGCGCCACCGCATCGACCCGACC 
CATACCGCACGGGTCTTCTTGGAGCAGGTCCACCAGACAACCTTGGCGGCCCTCAGCTACCAG 
GAGCACCCTTTTGACGAAATGGTGGCGACGCTCGGGTTCGCCGCCGATCCGGCTCGCAACCCG 
ATCTTCGACACGATGTTCTTGCTGCAGAACATGGCCATGGGTGCAACCACCATTCCCGGTCTG 
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CGGCTCTCGCCTCACGACACTTTTCACCGCAAGGCATTGTGCGACCTGATGCTACAGGCGACC 
GAGTATGACTGCCACCTGGAGCTGGTGCTCGAGTTCGCCACCGACCTGTTCCGGCTGGAAACC 
GCGCAAGTCTTGCTCGACCGCTACCGCCAAGTCTTGGAGTGGCTGTTGGCGTACCCCCATGAA 
TCGATAGACGATTTGACGCTCGCCGGCCACTTTCGCGAAGTCGAAGTGACGATGTCGGACGAG 
GGCGACTTTGATTTCTCAGATTTCGAACCCCGCAACGTGAGAAACCTATGGCGCGCC 

(2) peptide sequence 

Seq ID No 58 ( >pEPOcos6_ORF9 .pep) 

MKL^AA;ANRLFDPESPERTEPAKSLL1AVTKVLPQEVPNVRTRAISVDLDRSFDAAAPAWAAS 

LLVECGAPVEETWTYHGAARWLRRFDRVAVNGU3PFHPDQPAPLLRERGVYLITGGLGGVAG 

Q1ARYIJUUVCRARLVLTARRPLPERDQWDRESAVLSWDDKTRQRIELVRELERLGAEVLVVAA 

DVADEAAMAQAIEASLARFDALDGLIHGAGIVRVA3GRTPIGSMTRAMCEEQLRPKMLGLDW 

DRLLRDRRIXIFRIAISSIAPILGGLGHVAYAAANLYMDAFATRAAAGNAPWIALNLAEWEYEG 

PATYDERVGRSLKQLELTNEEGIRVFQTVLALAARGPLQQIIISTGDLQARLDKWIHIKSLHR 

RPGPVQLSRRTAAPQGGFGSERAAFEAAFADAWCDFFGVEEVDPNKNFFDLGASSLDFIHLVS 

RFSKAIEQHVPLEALLEHSTLHDIJ^HLAGDANTDASDEARIRQRLQGAKSGDIAIIGMAGRF 

PLAPDLDTYWRNLVGGIDAVSFFSAEELRAAGVTAAEIHHTNYVPAKGRCADQDLFDAAFFEY 

TASDAELMDPQNRVLHEWWHALEDACFDFNGDHGQVGLFAGAS PNLWWQFVAS FS EAAKTQG 

MFTTTLLNDKDS IATQI SYKLGLKGPAVTLFTGCSTSLVAVDAACRS IWSGQSDMAVAGAVSL 

TLPDKAGY I YEKGMLF 5 ADGHCRAFDANATGMVFGDGAGAI VLKPLDAALRDGDP I HAVI KGC 

ATNNDGD RKAGYTS VSAQGQAEV I RS AQ I LAD VAPE S 1 3 YVEAHGTGTKLGDS I EI KALKQAF 

ASDKNGFCGIGSVKTNLGHLMAAAGMAGLIKTVLAMKHRQLPPSLHCDEVNPDLELERSPFYI 

NTRLRDWVAPGGPLRAGVSSFGIGGTNAHVILEEPPTRESGTRMRHWKLLMLSAASEAALDRQ 

ADNLADYLERHPEAHLSDVAYSLQTGRRVLAWRRTVLCEYREDAVTSLRERQAKRVQTSRVRW 

DHKDVVFMFPGQGAQYLNMGRDLYVMEPVFREVMDRCFELLAPLWSEHPRQILYPEGGVSTLL 

HRTDYTQPIVFCFEYALAHLLLSWGLXPAATIGYSFGEYVSACLAGVFSLEDAIRLVTERGRL 

^4AALPAGAMLSVPVPECELLRLLDGFHAQSAAHIALAVDNGASCIVAGEQAAISAFESMLRKK 

RLLTMRVAVSHAAHSQVMTGATDALRSILRKIPLSAPTIPFISCVTGTWITAQQATDREYWVN 

HMCGTVRFAAGLTELGQNREAVFLEVGPGRDLTLLAHRILADSAAVFELVKAPDGGDDDGFLL 

LDRLAKLWRLGISIDWAGFYADERRRKLSLPGYPFERRRFWIEGNPLEIAAGRPNVQGPLVKA 



WO 00/22139 



123 



PCT7US99/23535 



SDIGAWFYVPQWRRSVLAEPGTTAAGAAVTAEQARWTELRAGCASAGLGSGACGLNGGAPSE 
RPKESVAPAGSTS AAAQTGADCPT PTGEPAAVPKDGAEPRPTWLI FADAGGLAES FAKRVQAR 
GEKLYLVASGSRFERLAETRFRLDPGAKSDHRLLFKALDEADILPTHLLDFRSLDCGGPDADP 
MDQAGFFGLLHLVQAMAEAGYSHPIRLLIVSCGVYDVTGAEPLQPARATMIGPALCIPQQYPH 
LETSHVDLGVVHADELHAARQLDSLLAECLSATAERQI^ 

PWRQRGVYLITGGLGGIGRILAEHLARTTSARLVLIGRETLPDRDDWDAWLNRPQPVDATHER 
LLHKIRAIRDLEALGAEVLVLAADVANEAAMREAYDRAESHFGTIHGVIHGAGLMDAQSFSLI 
DALDHDLCARQFEAKIRGVCVLDRVLADRTLDFCLLMSSISTVLGGLGYFGYAAANAFLDAFA 
QARSRDAAFPWLSVAWSDWKYWTERKMDNEVGAVIDSLSMEPAEGFEAVTRVLAWGKAPHIAN 
SPGDLGRRRDQWKLASLKSAHSSEPEPARHGRPALSSEWVAPRNVVEEKLVAIFEQVFGTAA 
LGIEDNFFELRGDSLKAVMTAARIQKEUSTVEVPLPTFFQMPTVAGLAQFVTQAKRSGRETIRR 
TAPRPHYPLSAAQGRHYLHYRMDPRCTAYNDPFANLIEGPLDVDRVERILHTLILRHDCFRTS 
FHFREGEPVQVIHDRVDFNLARITCAPEDLPERMRDFIRSFDLERPPAMRAGLFVTGPERHVL 
LIDFHHIITDGVSFENFVGEFAALYRGEILPELELEYKDFAVWQHENRGRRANSDQARYWTEQ 
LANAPGPIELTTDFPRPSRRSFRGDRVRTVLDAELVARLKEHAARLGITLYSLLLGGFSLLQH 
KLSDSHDIVIGSPVAGRTRSELQDLLGAFVNTLPMRHRIDPTHTARVFLEQVHQTTLAALSYQ 
EHPFDEMVATLGFAADPARNPI FDTMFLLQNMAMGATTI PGLRLS PHDTFHRKALCDLMLQAT 
EYDCHLELVLEFATDLFRLETAQVLLDRYRQVLEWLLAYPHESIDDLTLAGHFREVEVTMSDE 
GDFDFSDFEPRNVRNLWRA 

pEPOcos6_ORF10 sequences: 

(1) nucleotide sequence 

Seq ID No 59 ( >pEPOcos6_ORF10 . seq) 

ATGGCGCGCCTGAGCCGCACAGATCTCCAACTCGCCATTCACCAGCGCACCGTGGAGCGCGAA 
TATTGGCGCGCTCTGTTCGAGCGCCATCCGCAACGGTCCAGTTTGCCGGGGGTGCTCACCGCC 
CCGATCGGCGACGAGTCGACCCGCGAGACCTTGTCATTCGTCCTCGACGAAGATCCCCTTCGG 
CTGAGTAATCGTTCGCCGCAACGCCTGCTCACGGTGTTGGCGGCTGGCCTCGCGGCTTTCCTC 
CACCGCTGCGACGGCGCTGAGCGCTTCACCCTGGGGTTGGCCCTACCGCGCCAAGCCGATGAC 
CATCACCCGATCCTCAACAGCTTGATCGCGCTGGGGGTCGCGGTCGACTCGAGTACGACCTTC 
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CGCGATCTGCTCTATGCGCTTCGATCCGAATACCACGAGGCGATGCGCCACGCCAACTTTCCG 
CTGGCGACCTGGTGGCGCGGCCTACCCGGCGGAACGGCGCCGTTCGACGTCGCCCTCAGCCTG 
GACCCCTTCACAGACGGCGATTCGCTGGAAGACCACGCGATCGGCGCGTTGTTCCGGTTCGCA 
TTGGAGGGTGAGCGCCTCACCTGCCGATTGCGATTCGACCCTGCGCGCTATGACCGTCCCGCG 
ATCGAAAACCTCGCCGATCGTTTCGCCCGCTTCCTCACGCGCCTGTGCCGGGACGCCTCCACC 
GTCATCCAGGCGCTGGACCTTTCGCTGCCAAGCGATGAATCGGTGTGGCGCGTCACTGAAGGC 
GTGCGGCGCGGCTATTCGCAAGACCTGACGCTAGACCGCGCGTTCCGCCGCCAGGCCGCGCAA 
ACGCCCGATCAGCCGGCGATCACGTTGAACGGGGACGTCCAGAGCTACGCCGAGGTCGACCGC 
CGCAGCGACGCGCTGGCCCGCCACCTCCGTCGCCACGGCGTCGGTCCGGAAACGATTGTGGCC 
GTCAACGCCCGGCGCGGGCCTAATCAGCTGACGGCCCTGCTCGCGGTCCATAAGGCCGGCGGC 
GCCTACCTGCCGATCGATGCCGAGGAGCCGGCTGCCCGCCAGCAATTCAAGGTGCGCGACAGC 
GGGGCGCGGTTGGCACTGGAGCCGTCGCCGGACCAGGCGCTGACCGTCACCGACCTGCCGCGG 
CTCTTCCTGGACGATGCCTCGCTCTTCGCTGACGGCGGGCTCGATGTGCCGCGCGGCGCCGAC 
TCGCTCAATCCGGCCTATGTGATGTACACGTCCGGCTCGACCGGACAGCCCAAGGGTGTGGTG 
GTTCCCCACCGCGGCGTGGTCAATCGTTTGAATTGGGGGCAGTCCCGTTTCCCGCTGGACGAA 
CGCGACCGAATCCTCCAAAAGACGCCGCTGCTGTTCGACGTGTCGGTCTACGAGCTGTTCTGG 
GGCGCATGGAGCGGGGCCACCCTGGACATCCTCGAGCCCGGCGCCGAGCGCGACCCCGACGCA 
GTGGCCAGGGCCCTGGCCGAGCGCGCCATTACCGTATGCCATTTCGTGCCTTCGATGCTGCTC 
GTCTACTTGGAAGTCATGCGGCGGCACCATGCGCCGCCCGTGCCGGACCGGCTCCGTTACGTC 
TTCGTCAGTGGCGAGGCCCTCGAACCGGACCACCTCGCCGGGCTCCAGCAGATTGGTCGGCGC 
CTCGGCCGCACGATTCCCCTCGTTAATCTGTATGGACCAACCGAGGCCTCGATCGAAGTCTCC 
TGCTTCGCCTGTCCCGCCGACCATGTGCCGCGCCGGATCCCCATCGGGCAGCCGATCGACAAC 
GTCGCACTGCACGTTCTCGACCGGCGCGGCCGTCGCCAGCCGCCCTATCTTCCTGGCGAGCTG 
TTCCTGGCCGGCGACTGCCTGGCGCGCGGCTACCTCAACCGTCCCGACCTGACCGCGCTCCAC 
TTCGTGCCCAATCCCTTCGGCAACGGCGAGCGCATGTACCACAGCGGCGACTTGGCGCTCGTG 
CGCGGCGACGGCCAAGTGGCGTTTCTCGGCCGCCGTGACCACCAAATCAAAATCCGTGGTCAA 
CGGGTCGAACTGGGCGAAATCGAGAGTCATTTGCGCGGGCTCGAAGGCATCGCCGCCGCCGTC 
GTCCAGGCCGAGTCGCAGCACCATGAAACCCTGCTGCACGCCTACGTCGTCACCAACGACGCG 
GGCCTCAATGCGGCCCGGCTGCGCGCCGCCCTCGCTCAACATCTGCCCGAGTACATGATTCCC 
CAGCGCTTCTCGCGGCTGGCCGAGTTGCCGCTGCTGGCGGCAGGCAAGATCGACCGCGCCGCC 
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CTCGCGCAACGTGCAACGCCGCTCGCCAGCGGCGCGCCCTTCGTGGAACCCAGCGGGCCCACC 
CAGCAGCGTATCGCAGAACTGTGGCGCCAGGTCTTAGCGGTCGCCGAAGTCCGCGCCGACGAT 
CCCTTCTTCAGCATCGGCGGCAACTCGCTCAATGTGCTCAAGCTCAGCGCCGCGCTGAGCGAC 
GCCTTCGCGCGTGACATTCCCATGCCGGCCCTGTTCCAATACGACACCATCGCCGCCCAGGCC 
TCCTGGCTCGACGGGCAGGTTGACGAACGGGCCCAATCCGCCGCGCTCGACCGGCAGGCCGCC 
GAGGCGGCGCTGACCCTTCAAGAGACCGTGGCCATTTTTGAGGGATTCGATGACGAACCA 

(2) peptide sequence 

Seq ID No 60 (>pEPOcos6__ORF10 .pep) 

MARLSRTDLQLAIHQRTVEREYWRALFERHPQRSSLPGVLTAPIGDESTRETLSFVLDEDPLR 
LSNRSPQRLLTVIaAAGLAAFLHRCDGAERFTLGLALPRQADDHHPILNSLIALGVAVDSSTTF 
RDLLYALRSEYHEAMRHANFPLATWWRGLPGGTAPFDVALSLDPFTDGDSLEDHAIGALFRFA 
LEGERLTCRLRFDPARYDRPAIENLADRFARFLTRLCRDASTVIQALDLSLPSDESVWRVTEG 
VRRGYSQDLTLDRAFRRQAAQTPDQPAITLNGDVQSYAEVDRRSDALARHLRRHGVGPETIVA 
VNARRGPNQLTALLAVHKAGGAYLPIDAEEPAARQQFKVRDSGARLALEPSPDQALTVTDLPR 
LFLDDASLFADGGI^VPRGADSLNPAYVMYTSGSTGQPKGAAA/PHRGVVNRLNWGQSRFPLDE 
RDRILQKTPLLFDVSVYELFWGAWSGATLDILEPGAERDPDAVARALAERAITVCHFVPSMLL 
VYLEVMRRHHAPPVPDRLRYVFVSGEALEPDHLAGLQQIGRRLGRTIPLVNLYGPTEASIEVS 

CFAC PADHVPRR I P I GQP I DNVALHVLDRRGRRQP P YLPGELFLAGDCLARG YLNRPDLTALH 
FVPNPFGNGERMYHSGDLALVRGDGQVAFLGRRDHQIKIRGQRVELGEIESHLRGLEGIAAAV 

VQAESQHHETLLHAYWTNDAGLNAARLRAALAQHLPEYMIPQRFSRLAELPLLAAGKIDRAA 

LAQRATPLASGAPFVEPSGPTQQRIAELWRQVLAVAEVGAEDPFFSIGGNSLNVLKLSAALSD 

AFARDI PMPALFQYDTI AAQASWLDGQVDERAQSAALDRQAAEAALTLQETVAI FEGFDDEP 
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pEPOcos6_ORFll sequences: 

(1) nucleotide sequence 

Seq ID No 61 ( >pEPOcos6_ORFll . seq) 

ATGACGAACCATGACCATCACGAGGAGAGCAGCGGCCTGGAGATCGCCGTCATCAGCATGGCC 

TGCCGATTCCCGGGTGCTATGGCCTGCCGATTCCCGGGTGCTGCCGATTGCGACGCATTCTGG 

GAAAACCTGATCAACGGGACCTCCTCGATCACCCATTTCAGCGACGACGAGCTGATCGCGGCC 

GGCGTTGACGCGCGCGACCTGACGCCGCAGTACGTGCGCGCGGCCGGCCAGATCGATGACGCC 

GAACGGTTCGACGCGGCCTTCTTTGGGTACTCCCAGCGTGAGGCCGAGCTGATGGACCCCCAG 

TTCCGCCTGCTCCATGAATGCGCCTGGTCCTGTCTGGAACAGGCCGGCATCGATCCGCGCGTC 

GAAGCCGCGCCGATCGGGCTGTAT3CCGGCGCAGCCGACAACACCTACTGGAACGCGCTCTCG 

TCGCTCGACCGGGGCTCGGCCGAATCGGAGCAATTCGCCGCCGAACAACTTTGCAACCGCGAT 

TTTCTGTGCACGCTGGTCGCCGCCGCGCTCAACCTGAAAGGCCCCGCGGTGGTGGTTCAAAGC 

GCCTGTTCGACCTCGCTGTTGGCGGTCCACTCGGCCTGTCGTGCGCTCCTGACCGGCGAATGC 

CGAGTGGCCTTGGCCGGTGGGGTGGCGCTGCGCTTCCCACGCCCGAGdGGTTATCGCTACGAA 

CCTGGCATGATCTTCTCGCCCGACGGGGTGTGCCGGCCGTTCGACGCGGGCGCTAACGGGACG 

GTGCCCGGCGAAGGCGCGGGGCTGGTAGCGTTGAAGACGCTGAAACGTGCCCTCCAGGACGGC 

GACACGATCCACGCCGTGATTCGCGCGACCGCGGCAAACAACGATGGTGCCCGCAAGACCGGG 

TTCACCGCGCCCAGCGCCCACGGCCAAGCCGAAGTCATTCGCACGGCGCTGCGCCTGGCCCGG 

GTGCCGGCCGAATCGATCGACTACGTCGAGGCCCACGGAACCGGCACGCCGCTAGGCGACCCG 

ATCGAGGTAGCCGGCTTGGTGGAGGCCTTCGCCAGCGAGAAGCGCGGCTATTGCCGGCTGGGC 

TCGGTCAAATCCAACCTTGGTCATCTGGACACTGCTGCCGGCATCGCCGGCCTGATCAAGACC 

GTGCTGGCGCTCGAGCACGCGCACATCCCCAAGTCCTGCCACGTCGCCACGCCCAACCCCGCG 

GCGCGCCTACACAAGACGCCTTTCCGCATTGCCGCCGACGGGATGGCCTGGCCGCGGCGTATG 

GCGACGCCGCGGCGGGCGGCGGTGAGTTCGTTCGGCATCGGCGGCACCAACGTCCACGCGATT 

TTGGAGGAGGCGCCGCCCCGCGCGCCCGAGCTGGCGGACGGGCGCAGTCAGGTGTTCGTCTTC 

TCCGCCAAGGACGAGGCGGCGCTGGACCGTGCCCTTGCCAACTATGGTGCGGCCTTGGAGAAG 

CGCGGCGACCTCGCGGCGGGCGCGGTGGCCTGGACGCTCCAAAACGGCCGGGCCGCATTCX3AA 

TGGCGAGCCAGCGCGGTGGCATCCGACCTCGACGAATTGGCGGGCGCATTGCGCGGCGAGCGG 

CCCGGCGCCGTCAAGAAAAACCGAATGGCGCGCGAGGATAAGCCGGTGGCGTTCTTATGTTCG 
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GGGCAGGGGAGCCAGTACCGTGGCATGGGCCACGACCTGTACCGCGAAGAGCCGCGTTTCCGG 

CACCACCTCGACGCCTGCCTCGCCATCCTCGCCGAACACAAGCCCGAGATCGACTGGCTGGCG 

TTGCTGGGCTACCGCGACGAGGACGAGCCAACCGACCAGATCGGGACGTCCTCGCAGGGCCCG 

AGCCGGTCAGCCGCATCGAACCCAGCGGAGCTCCTCGACAGCACCGAATTCGCCCAACCTTTG 

CTTTTCTCCATGTCCTACGCGCTCGGTCGGCTGTGGCTCGACTGGGGCGTGCGACCCACGGCG 

ATGATCGGGCACAGCCTGGGCGAGTACAGTGCTGCATGTATTGCAGATTTCTATGCACTCGAT 

CAGGTGCTGCCCTTCATTCTGACCCGCGGTCGAGTCATGGCGCAATTGCGGCGCGGCTCGATG 

TTGGCCGTCAGCGGTGACAGCGTTCTGATGCGCGAGCTGATCGCCGATGCGCTCGATTTGGCG 

GCGATCAACGGCGCTGACCAATTTGTCTGGAGCGGGCCGAGCGAGGCTGTCCAAGCCGCGGGG 

GTCCGACTGCGCGGCGCCGGCCTGCGTGCCACCGAGCTGAACACCTCACACGCGTTCCATTCA 

GCCATGATGGATCCCATTCTGGAGGAGCTAACGGTTGCCGGTTCGCGACTTCAGGTCGGTGTC 

GGGACGATTCCGGTCGTTTCATGC3TTACCGGAACCTGGTTGACGGCGAAGCAGCTGGCCGAT 

CCGCGCTACCACGCGCGTCACGCGCGCGAACCGGTGCGGTTCGCGGCGGGCCTAGCGACGCTG 

ACAGGGGAGGAGCCGCCGCTGATGCTCGAAGTGGGGCCGGGCTCGACCCTGGCGGCTTTGGCC 

CGCGAGCATTCGAATGCCCGCCTCCCGGTCGTCACCAGCCTGCGCCACGCTCGCCAGGCGACG 

CCCGATCGCCAATACCTGCTCGAAACGCTCGGCTGCCTTTGGCGACACGGGGTTTCCGTCGAT 

TGGGGGGCCCATGCCGGACGTTCGCGACGCTTGGTTTCGCTGCCCGGCTATCCCTTTTCCGGC 

GCGGTGCGCCGCTTAGCCGGCGACCCCCTCCGCCTGCTGGCCGGAGCCCGCGCCGTCGCCGCC 

CCGTCGGGAACGCGCCAACTCAGCGCCGACGCGCGCGACCTCCCGAACACTCCGGAGCCGACA 

TCCGGCGCCGTGTCGGCGATCAAAGCGCCAATCGCCGCCGCCGATCCCGGCCTCTATCGCCTC 

TCCTGGCGCCAGGCCGGAACGGCGCCGCTCGGTCCGCCCGATCTCGGTCCGCCCCGCGACTGG 

ATCGTCTTCGCCTCTGATTCTCACCTGCTCCAGGCGCTCAGGGCCAATCTCGGGACGCGCGCT 

CAGCGGGTGACGCTGGTGACGCCGGGCCAGGAGTACGCAGCCGAGCCGTCCGGGTTTCGGCTG 

CGGCCGGACCAGATCGACGATTACCGCGCCCTGTGGGCGGACTTGGCGCAAACCGGTATTGTG 

CCACGATACATCGCGTTCCTCGCCCCGTTCATGTACCGGGCGCGCATGGCGGGCGATGCCTCG 

ACCCTGGACGAAGTGCGCGAGGGCGGCTTCCTGCCCCTGACCCGCTTGATCCAGACTCGCCCG 

CCAGGCGGACCGAGCGGACTTCTAAGCCTCACGATCGTCACCCCGGCCGCCQTGGCGCTGGGC 

GACGAAGCGACGCGCCCGGAATGGGCAATCCTGCACGGGATGGTCGCCGGCTTAAGCCGCGAT 

TATCCCGAATGGCGCTTCGTCTCGATCGACGGCGGCGACCCATCCCCGCATCGGTGCGAAGGT 

CTGGCCCGCTTGATCGCGCTTCATGCGGTCGACGAGGCTGGCCCGACCCGCTTGGCGCTGCGC 
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GGCCTTCACGCTTGGGTTCCACAGTGCGAGCACGTTCAGCCGGCCACCATCCCTGGGGCGGGT 
ATGTGGCGCGAGGGTGGTGTGTACATGATAACGGGCGGATTCGGCGGGATCGGTCTGGCGCTG 
GCCCGCGCCCTGGCTCGAGAAGCTCGCGCCAAGCTGATCCTGGTCGGCCGAAACCTGCCCACC 
GCGCCGATCGATCTCGAGGCTTGGGACGCGCCGCCGTTGATTCTCACCGCCGACGTCGCCGAC 
5 GAAGAGGCCATGCGCCGCGTCTTCGATGCCGCGCACGCCCGGTTCGGCGCCATCGACGGCATT 
CTTCACGCGGCCGGTGTCCCCGGTGGCAGCCTGTTCGCCAACCAATCGGACGCGGCCTTCGAA 
GACGTGCTGCACGCCAAGGTTCGCGGTACCCTCGTGCTGCAAGGCCTGAGGGCAATCGATGCG 
CCGCTGTTGCTGATGTCCTCGCTGGACGCCTGGCTTCCCGGTCCCGGTCAGACCGCCTATGCC 
GCCGCCAACGCCTTCCTCGACGCCTTCGCCAGTCTGCGCCGGCGAGAGGGAGAGCCGGTGTAC 

IC AGCGTTGGCTGGGACAGTTGGTGCGAGGTGGGCATGGCTGCTCGGGTCGCTGCCCGATCGGCC 
GACGAACGCGGCCGCCTGGCGCGCGAGGGGATCAGCCCTCGCCAGGGTTGGCAGGCTTTGAGC 
CGGGCGCTCGCCCTCGACCCCCCCCACCTGATGATCTCGCGCACCGACCTGACCTCGCGCTGG 
CACAGTCGATCCAGCCCTACGCCGGTCGCCTCGAGCGAACCCGAGGTGGCGCTGCCGCGCTGG 
ACCGCATCCGCCTGCCAAGCCGTCATCGAGCGTGTTTGGTGCGAGCACTTCGCCACCGCCGCC 

15 GTGCCTCCCGATGGCAACTTTTTCGAGCTCGGCGCCAGTTCCTTCGACATCGTCCAGCTCAGC 
GCTCGACTTCAACAACAGTTCGGCCGAGATGTCAGCCACACCGTGCTCTACAGTCATCCCACC 
GTCGCCTTGCTGGCCGGCTACTTCGCCAATGACCCGACGCCGTCCGGTGCTGCTGCCGACGAA 
CGCGACGAAGCGGTGCGTCGCGGCCGCGACCTCTTGAAGAGCCGCCGGCGAGGAGTA 

2 0 (2) peptide sequence 

Seq ID No 62 (>pEPOcos6_ORFll .pep) 

MTNHDHHEESSGLEIAVISMACRFPGAADCDAFWENLINGTSSITHFSDDELIAAGVDARDLT 
PQYVRAAGQIDDAERFDAAFFGYSQREAELMDPQFRLLHECAWSCLEQAGIDPRVEAAPIGLY 
AGAADNTYWNALSSI^RGSAESEOFAAEQLCNRDFLCTLVAAAIiNLKGPAVVVQSACSTSLLA 

25 VHSACRALLTGECRVALAGGVALRFPRPSGYRYEPGMIFSPDGVCRPFDAGANGTVPGEGAGL 
VALKTLKRALQDGDTI HAVIRATAANNDGARKTGFTAPSAHGQAEVIRTALRLARVPAES IDY 
VEAHGTGTPLGDP I EVAGLVEAFASEKRGYCRLGS VKSNLGHLDTAAGIAGLI KTVLALEHAH 
IPKSCHVATPNPAARLHKTPFRIAADGMAWPRRMATPRRAAVSSFGIGGTNVHAILEEAPPRA 
PELADGRSQVFVFSAKDEAALDRALANYGAALEKRGDIiAAGAVAW 

30 DLDELAGALRGERPGAYKKNRMAREDKPVAFLCSGQGSQYRGMGHDLYREEPRFRHHLDACLA 
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ILAEHKPEIDWLALLGYRDEDEPTDQIGTSSQGPSRSAASNPAELLDSTEFAQPLLFSMSYAL 
GRLWLDWGVRPTAMIGKSLGEYSAACIADFYALDQVLPFILTRGRVMAQLRRGSMLAVSGDSV 
LMRELIADAIJDIJ\AINGADQFWSGPSEAVQAAGVRL^ 

ELTVAGSRLQVGVGT I PWS CVTGTWLTAXQLADPRYHARHARE P VRFAAGLATLTGEEPF LM 
LEVGPGSTLAALAREHSNARLPWTSLRHARQATPDRQYLLETLGCLWRHGVSVDWGAHAGRS 
RRLVSLPGYPFSGAVRRLAGDPLRLLAGARAVAAPSGTRQLSADARDLPNTPEPTSGAVSAIK 
APIAAADPGLYRLSWRQAGTAPLGPPDLGPPRDWIVFASDSHLLQALRANLGTRAQRVTLVTP 
GQEYAAEPSGFRLRPDQIDDYRALWADLAQTGIVPRYIAFLAPFMYRARMAGDASTLDEVREG 
GFLPLTRLIQTRPPGGPSGLLSLTIVTPAALALGDEATRPEWAILKGMVAGLSRDYPEWRFVS 
IDGGDPSPHRCEGLARLIALHAVDEAGPTRLALRGL?iAWVPQCEHVQPATIPGAGMWREGGVY 
M I TGGFGG I GLALARALAREARA KL I LVGRNLPTAP I DLEAWDAF PL I LTADVADEEAMRR VF 
DAAHARFGA I DG I LHAAGVPGGS LFANQSDAAFEDVLHAKVRGTLVLQGLRAI DAPLLLMSSL 
DAWLPGPGQTAYAAANAFLDAFASLRRREGEPVYSVGWDSWCEVGMAARVAARSADERGRLAR 
EG I S PRQGWQALSRALALDP PHLM I SRTDLTSRWHSRS 3 PTPVAS SE PEVALPRWTASACQAV 
IERVWCEHFATAAVPPDGNFFELGASSFDIVQLSARLQQQFGRDVSHTVLYSHPTVALLAGYF 
ANDPTPSGAAADERDEAVRRGRDLLKSRRRGV 

pEPOcos6 ORF12 sequences: 

(1) nucleotide sequence 

Seq ID No 63 { >pEPOcos6_ORFl'2 . seq) 

ATGACCGTGGAGCACGAAACCGGATTCGAAATCGCCGTCATCGGGCTGGCTTGCCGCGTTCCC 
GGCGCTGCCGACGTGGCCGCCTTCTGGCGCAACCTGGTCGAGGCCAAGGAGAGCGTGCGCTTC 
TTCGAGGACCACGAGCTGCGGGCCGCCGGCGTGCCCGAGGAGATCTTGCGCCTGCCCAACTAC 
GTGAAGGCCAAGCCACTGCTCGCTGATGGCGAAGCTTTCGACGCGGACTTCTTCGGGTTCCAT 
CCGCGCGAGGCCGCCTACCTGGACCCGCAAGTTCGGCTCCTGCACGAATGTTGTTGGACCGCG 
CTGGAGGATGCCGGCTACGATCCCGCGCAGTACGCCTACCCGATCGGGTTGTTCGCGGGCGTC 
TCCAGCAATCTCTCGTTCCTGTTCGACCGCATCGATCCGCGCGACTCCCCCCTGCAGAAGCGC 
TATGTGGCCGAGCTGAACGCGGCCTCCTTCGCCACCCAGATCGCCTACCGGCTCGATCTGAAG 
GGGCCGGCCATTTCGATTCAAACCGCCTGTTCGACGTCACTGGTGGCGATTCACCTGGCGGCG 
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CAAAGCCTGATCGGCGGCGAGTGCCACATGGCCTTGGCCGGCGGAGCGACCTTGGAGGTCCCC 

AAAAAGCCCGGCTATCTCTACCGCGAAGGCTACATCAACTCGCCGGACGGCCACTGCCGGGCC 

TTCGACGCCGACGCGGCCGGCACCATCTTCGGCGACGGCGTCGGCATCGTCCTGCTCAAACGC 

TACCGCGACGCCCTACGCGACGGCGATCACGTGTACGCAGTGATCAAAGGCTCGGCGATCAAC 

AGTGACGGCCATCGCAAGGTGTCC7ACACGGCGCCGGGCAAGAGCGGTCAAGTGGCGGTGATC 

CGCGCTGCGCTGGCGGCGGCCCAG3TAGAGCCGCAAACCATTCGCTTCGTCGAGGCCCACGGG 

ACCGGCACACTCGCCGGCGATCCGATCGAGGTAGAGGCGTTGACGGAGGTCTTTGCCGAAGCG 

GGTCGCGGTACCTGCGCCCTGGGTTCGGTGAAGACCAACATCGGCCACTTGGATGTGGCGGCG 

GGCGTGGCCGGTTTCATCAAGGCGGTCTTGGCGCTCGAGCGGCGCGTCCTCCCGCCCAGCCTT 

CACTTCGTCCGGCCCAACCCGGCCATCGATTTCAACGGGCCCTTCTACGTTTGTCGCCAAATC 

GAGCGGTTGACGGAGAACGGGCGG7TGCGGGCCGGGGTGAGTTCCTTTGGCATTGGCGGCACC 

AATGCCCACGTGATTCTGGAGGAASCGCCGGCGCCGGAGGCGAGACTGCCGGCCGGGAGCCCG 

CCAGGCGCGAGTCCGTTCCTGTTCCCGCTATCGGCCAAGACGCCGGATGCGCTGGCAGGCCGT 

TGCCACGACCTTGCCGACCACCTGCGGGCGCACCCCGAGCTCCTCCTGGCCGATGTGGCCCTC 

ACTCTGCAGATGGGGCGGGCGTCGTTCGCCTACCGCCATGTGGTCCAGGCTGCGACGGCGGAG 

GAGCTGATTCGCGGTCTGGGAGCGTTCCGACAGGAGTCCATCCGCAAGAGGCGGAATCGAGTA 

CAATGGGTGTTGGCAGGCGAGGCGATGTCGCTTGACGCCGGTTTGCGGCTGTACGCCGATTGG 

CCGGTCTATCGGGAGCGGGTCGACGTCTGTCTGGCGATCGTCGCCAAGCTGCGCCAAATCGAC 

GGCCGGTCATTCCTACATGAGTGGATCGAGCGACCGCGCGAGGTTCCTGCCGAATGGTCGACG 

GCGCTGGCGTTCATGTTCCACTGCGCGCTGGCGCAAGCCCTGAGCCAGGCCGGCCTGCACCCG 

CAGCGCATGTGGAGCCGTGGGCTGGGCGGACAGGTCGGCGTGGTTTTGGCCGAATCCCTGTCG 

TTGGAACAAGCGCTGGCGCTGGTGTTGTGCCAGACACCGGTTCCCGGCGATGCCACACCTCAG 

CGCGAACGCTTGGTTCGGACACTG3AAGGCTGCCGGTTTCGTCCACCACGATTTTTGATTTCG 

GCAGACAGCTCGGGTCGACCCCTGGACCTCGCCGAATTCGCTCATGTCGATTTTTGGTGCGGT 

GGCCAAAGCGCCTCGCCCAATGAGGCGGAGCTGCGCTCATGGAGCGACGCCGCGCCCGAGCTG 

GTGACCTTGGCGATCGGCCCATCCTTTCTCGAGGCCGCCTCCGGGACGGTGGGTCTGGCGATC 

GACCCCAAGCGACCGATGACCTGTGTTCAGCGCACGGTGGCCGCGTTGTGGGAATGGGGATGT 

GACGTGCGCTGGGCTGCGTTCACCTCGTCGACCGGGCGTCGGGTTCCCCTGCCTACCTATCCC 

TTCGTGCGGGTAATTCCCACGATCGGCGACCCCCTTCGCGGAGCAGGCGCGGAGGATGACTTG 

ATTGCGGCGAGCGCTTCCGCGTCG3CCGGATCGCCGCCCGAGCCGTCGGCAAACTCGGCAGCG 
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GAACGCCCACGCGCCCAGTCAAGCATCGCCTCGGCAACCACACCGGCTCCGTCTCATACGTCG 

GCCAGCGTGGCCGTGGCCACCATTCTCGAAACCGTCCGTGCCTATTTCGGGTTCGCCGCCGTG 

CGTTCCACCGACGCCTTCTTCGAATTGGGCGCGTCCTCGCTGGATTTGGTCAACCTGGGCCAG 

CTCCTTTCCGATCGTCTCGGCCGCGAGGTTCCGACCCTGCTCCTCTACGACCACCCAACACCG 

GACCAGTTGGCGCTGGCCCTGACATCCGCGGCGCTCAGCGCAGAGGCGCCGCCCTTAAGGGGC 

GGTCATCGCGCATCGACTTCCGGCACAGCCGCGAGCTCGGCCGCCTCCACCGCACCGACGTTC 

CCGGGGGACGCTCACTCGCAGCCCAGCTTCGTTCGCGAGCAGGACATCGCCATCATCGGGATG 

GCCTTCCGGGGACCGGGCGCCGACGACCTGGACGCGTTCTGGAACAACCTGGTCGAAGGGGTC 

GAGTCGATCACCTTCTTCAGCGAGGACGAGCTGCTGGCGGCGGGCGTCCCCCGCGAACATCTG 

GCCTCGACGCGCTACGTGCGGGCCAAGGGGGAACTGACTGGGATGATGGATTTCGAACCGGAA 

TTTTTCGGTTATTCGGCGCGCGAGGCGGCGGTCATGGACCCGCAGTTCCGCGTGTTCCACGAA 

TGCTCCTGGCACGCACTGGAGCACGGCGGCTACGATCCGACCCGATGCGCGGCATCGATTGGC 

GTCTACGCCGGCGTGACCAACCACCTGCCTTGGCTGATGCGAACTTTGCCGCACGTGACCGAG 

GAGGAGCAATTCGGCGCGCTGCTCCTCACCGACCGCGAGTTTTTCGCACCGCTGCTCTCCTAC 

AAGGTCGGCCTGCGCGGACCCGCTATTTCGCTGCAAACCGCCTGTTCGACGTCGTTGGTGGCG 

ATCGGCACGGCCTGTCGCGAATTGCGCGCGGGTGCCTGTCAGATGGCCCTAGCGGGCGGCGTG 

ACGGCCAGCATCGAGCGCTGCGGCTACTTCCACCAAGAAGGCTACATCCTCTCGCCTGACGGC 

CACACGCGCAGCTTCGACGCGGCGGCCGCCGGCACGGTCTTCGGCGACGGAGTCGGCATGGTG 

CTGCTGAAGCCGCTGGCCCAAGCCTTGGCCGACGGCGACACGATCCACQCGGTGATCAAGGGA 

ATCGGCATCAACAACGACGGCGCGCGCAAGGTCGGCTTCACCGCACCTAGCCGGGCCGGTCAG 

ACCGAGGCGATTCGGGCCGCGCTGCGCGACGCCGGGGTGGCGTCGAACCGCGTCAGCTACGTG 

GAGGCGCATGGAACCGCGACCAGAATGGGCGACCCGATCGAGGTCGAGGCCTTGACCCAAGCC 
TTTCGCGCCG ^ GCCGACGGTCCGCTTCCGCCCGGCXCCTGCCTACTCGGCTCGGXGAAGTCC 

AAC3TGGGCCACCTGAACGCCGCGGCCGGCGTGGCTGGTCTGGTAAAAACCGTGCTGGCGCTC 
CAACACCGCCGCCTGCCGACCAGCCTGTTCTACCAGTCGCCCAATCCACACATCGACTTTGCG 
GCGAGTCCGTTCCGCGTGAACGGCCAGACTTCGGATTGGGTCGCGCCAGAGGGGACGCGGTTG 
CTGGCGGGAGTGAGTTCGTTCGGTATCGGGGGAACCAACGCCCACCTGATCGTCGAGGAGGCG 
CCGAAAGCGCTACCGACGACAGCGGCACCTCTGTCGACGGAGCCGAATGACCTCGACGCGGGC 

GAC3CCGACGGGCTAGTGCTGCCGATCTCGGCCCGCACGCCGACCGCCCTGGCGCACATCGCG 
ACCAACCTCGCCAATCACCTGGAACGACATCCGACCATCGCCCTGGCCGACGTCGCCCTGACC 



WO 00/22139 



PCT/US99/23535 



132 



CTTCAGCTGGGCCGTCGCCAATGGCCCCATCGCCACAGCCTGATCTGCCGGAATCGAACGGAG 

GCGATCAAGCTGCTGCGCGCCGTCGTCCACTCCGCGGAGGTGCCGCCAGCTCAGGCGCCGGTC 

TCGGATGCGCCGCGCTGTGTTTTTCTTTTTCCCGGCCAGGGCGCCCAATACCCGAGCATGGCC 

CGCGACCTGGTTCGAAACTGTCCCGACTTCGCCCTGCACCTGGACCCCTGCCTCGACCAGTTG 

GCCGAACTGCTTCCCGAAGATCCGCGTTGCATCCTGTTCGGCGATGGCCCCGCCGATCGGCTC 

GACCAGACGGCCTACACTCAGCCGCTGCTCTTCTCCGTGTCCTACGCCTTGGCGCGCTGGTTG 

GGCGATTTCGGCATTCGCCCCGATGCGATGATCGGCCACAGCCTGGGCGAATACGTGGCGGCC 

TGCTTGGCCGGGCTTTTCTCGCTGAGCGATGCCCTGCTGCTGGTGAGTGAACGCGGCCGCCTG 

ATGGGCTCGGCCGCGCGCGGAGCGATGCTGGCCGTCCCCTTGCCCGAATGGGAACTGGAGGAA 

CGCCTGGAGCTTCTGGCCGACGACCGAATCAGCATCGCGGCGGTCAACACCGCCGAGAGCTGC 

GTCATCGCGGGACCCAGCGAGGCGATCGAGCGCTGCGCCCAGCGCTGGGCCGCGCAAGGCCTG 

ACCTGTACGCCGCTGCGCACGTCCCACGCCTTCCACTCCGCGATGATGGAGCCGATTGTCGAA 

CCCTTCGGCCATGTCTTGGCACGGGTCACCTTCGCGCCGCCGCGCGCGCGCTGGATCTCGAAC 

CTCGACGGCAAGCCGATCGATTCCGCGGCGGTGATGCAGCCCGACTATTGGGTGCGCCACCTG 

CGCCAACCGGTCCGCTTTCACGAGGGACTCAGTCACCTGTTGGCCGAGGACACCCATGCTTGG 

GTCGAAGTGGGTCCCGGCCGAACCCTGTCCTCCTTCGTCCGCCGCCACCCGGCCTACCGTCAC 

CAGCCAATCGTCAACCCCATGCGCCATGCAGTCGAGTCGACGGGCGACGTGCGCCGGTGGCGC 

CAA3CGCTGGGCGAACTATGGCGGGCCGGCATGCCGGTCGCCTGGGAGCGGCAGCGGCGCGGC 

CGGCATGCCGGACGACGTGTGCCGCTGCCGGGCTACCCCTTCGAGCGGCGGCCCTTCGCGGCC 

CGAAGACCGGTGGAGCTGGCGCAGCCCGCGCCCAAGGCGGAGCTGGTGAAAAACCCCGATCCC 

GCGCGGTGGCTGTACCGCCGCGTCTGGCGCCCTGCCCAGGCTGCGGCCGGCGGACTGGCGGTG 

CAGGCGACCGTTCTGGTCTTCGGCGACGGGTCCGAGCTGTGCCGCGCGGCGGTCGCTCAGGTG 

CAGCGCCAGGGGCTGAAGTGCGTCTCGATCACCGCGGGCCGCCAATTCGCGCGGGAGAGCGAC 

ATGCGCTTCACGCTTGACCCCGCTGATCCGCGCCAGCTCGACCAGCTCTTCGCGGCCCTCGAT 

GGCTCAGGCTCGCGGCCGCGGTACGTCCTGCACCTGCTGACCCTGAACCCGCCCCCGGATGCC 

TCGGCGATCATCGCTCACAGCTACTACAGCCCGATGGCCTTGGCTCATGCCTTGGGCGCCCAC 

GAGATCGCGCCTGTCTCGATCACCGTCGTCACCGCCGGGGTCGTCGCCGTCGCGGACGAAGCG 

ATTCGCGAGCCGCTGCAGGCGCTGATCGTGGGCCCGTGCCTGGTCATCCCGCAGGAGTTTCCC 

GGC-CTCAGCGTTCGGCTGCTGGACGTCAACGTCGACGATCCGGCACCGCGTCTGGCGGAGCGG 

CTCGTGGCCGAGCTCTCGGGCACGGATCACATGGTGGCGCTGCGCGGCGGCGAGCGCCTAGTG 
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GCCGATGTCGATCAAGTCGATGGCCTCGGTGTGGGGATCGCCAAGGTGCCCTTGCX3CCGCGAG 
GGCCACTACCTGATTCTCGGCGGCCTGGGCGATATCGGCTACCACTGTGCCCGCTATCTGGCC 
CAAACCTACCGCGCCAAGCTGACGCTGACCGCGCGTTCGTCACTCCCGCCGCGCGCGTCGTGG 

GAGCGAATGCTGCGCGAGGGAAACCTGGATTCCCGGCAGCGCACGCGCATCGAGCGCGTGTTG 

TCGCTAGAGGCGTGCGGGGCCGAAGTCCAGACGGCTGCGGTCGACTTGGGCGATGGCCATCGC 

TTGGCCGATGTGTTCCGCGAAGCACGGGGCCGATTCGGCGCCATCGCGGGCGTGATTCACTCG 

GCGGGGATTCCGGGACACGTCCACTCGATCGACGAGCTGGTGCGCGTCCGCGACGAAGCCCAA 

TTCACCGCGAAGGTTCGAGGGCTGCACCACCTGGCCGAGGTCGTCGATCCGCTGAACCTCGAC 

TTTTGTCTGCTGTTCTCCTCGCTCTCGACCGTCCTCGGCGGGCTCGGCTACGGCGCCTATGCA 

GCGGCCAACGCCTACATGGACAGCTTCGCCCGCCGCCACGATCGGCCGGACGAATGTCGTTGG 

ATCGCGGTCAACTGGGACGCCTGGCTGTTCGAAGCCAAGACGTCGTCGGTCGGCGCCGAATTG 

GCGCGCCTGGCGATCGTGCCCGAGGACGCTCCGGCCCTGTTCGCGCGGGTGCTAGAGCGACTT 

CCGCAATCGTTCATCGTGTCCACCGCCGACCTTCGGGCCCGCATCGACACTTGGATCCGGGAC 

AAGAACCGCGTCCCGCCCGCCGAGATCCGAGCGGTTCAACCGCGACCGGACCTGAGCCAGGCG 

TACGCCCCGCCGATCGGCCCGCTGGAGATTCAACTCTGCGGGCTGGTCTCCGCCTATTGCCGG 

TTCGACCGGATCGGGCGGGACGATTCCTTCTTCGAAATCGGCCTCAGCTCGTTCGACTTGATC 

CAGCTCAGCTCGCGCATTCACCGCATCACCGGCAAGGATCTCAATACGACCCAACTGTTCAGC 

TACCCCACCGTGCGCGCCTTGGCGCTCTTCCTCGGCGGCGAACCGGAGGGGCTCGCGGCGGAG 

GAGCCCGCCATGGAGAACCTGTGGCTGCAACGAAGCGATGCGACCCTCGATGAG 

(2) peptide sequence 

Seq ID No 64 £ >pEPOcos6_ORF12 . pep) 

MTVEHETGFEIAVIGLACRVPGAADVAAFWRNLVEAKESVRFFEDHELRAAGVPEEILRLPNY 
VKAKPLLADGEAFDADFFGFHPREAAYLDPQVRLLHECCWTALEDAGYDPAQYAYPIGLFAGV 
SSNLSFLFDRIDPRDSPLQKRYVAELNAASFATQIAYRLDLKGPAISIQTACSTSLVAIHLAA 

Q S L I GGE CHMALAGGATLEVPKKPG YLYREGY INSPDGHCRAFDADAAGT I FGDG VG I VLLKR 
YRDALRDGDHVY AVI KGS AINSDGHRKVS YTAPGKSGQVAVI RAALAAAO VE POT I RFVEAHG 
TGTLAGDPIEVEALTEVFAEAGRGTCALGSVKTNIGHLDVAAGVAGFIKAVLALERRVLPPSL 
HFVRPNPAIDFNGPFYVCRQIERLTENGRLRAGVSSFGIGGTNAHVILEEAPAPEARLPAGSP 
PGASPFLFPLSAKTPDALAGRCHDLADHLRAHPELLLADVALTLQMGRASFAYRHWQAATAE 
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elirgi/safrqesirkrr1jrvqwlageamsldaglrlyadwpvyrervdvclaivaklrqid 

grsflhewierprevpaewstalarmfhcalaqaiisqaglhpqrmwsrglggqvgvvlaesls 

leoalalvlcqtpvpgdatpqrer1vrtlegcrfrpprflisadssgrpldlaefahvdfwcg 

gqsaspneaelrswsdaapelvtiaigpsfleaasgtvglaidpkrpmtcvqrtvaalwewgc 

dvrwaaftsstgrrvplptypfvrviptigdplrgagaeddliaasasasagsppepsansaa 

erpraqssiasattpapshtsasvavatiletvrayfgfaavrstdaffelgassldlvnlgq 

llsdrlgrevptlllydhptpdqlalaltsaalsaeapplrgghrastsgtaassaastaptf 

pgdahsqpsfvreqdiaiigmafrgpgaddldafwnnlvegvesitffsedellaagvprehl 

astryvrakgeltgmmdfepeffgi'sareaavmdpqfrvfhecswhalehggydptrcaasig 

vyagvtnhlpwlmrtlphlteeeqrgallltdrefpapllsykvglrgpaislqtacstslva 

i gtacrelragacqmalaggvtas - srcgyfhqegyi ls pdghtrs fdaaaagtvfgdgvgmv 

llkplaqaladgdtihavikgigixitogarkvgftapsragqteairaalrdagvasnrvsyv 

eahgtatrmgdpievealtqafraeadgplppgscllgsvksnvghlnaaagvaglvktvlal 

qhrrlpts lfyqs pnph i dfaas ? ? rvngqtsdwvapegtrllagvs s fgiggtnahl i veea 

pkalpttaaplstepltoldagdadglvlpisartptalahiatotianhlerhptialadvalt 

lqlgrrqwphrhslicrnrteaikllrawhsaevppaqapvsdaprcvflfpgqgaqypsma 

rdlvrncpdfalhldpcldqlaellpedprcilfgdgpadrldqtaytqpllfsvsyalarwl 

gdfgirpdamighslgeyvaaclaglfslsdalllvsergrlmgsaargamlavplpewelee 

rlelladdrisiaavntaescviagpseaiercaqrwaaqgltctplrtshafhsammepive 

p fgkvlarvtf ap prar w i snldgx? i ds aavmqpd ywvrhlrqp vrfheglshllaedthaw 

vevgpgrtlssfvrrhpayrhqpi'.'npmrhavestgdvrrwrqalgelwragmpvawerqrrg 

rhagrrvplpgypferrpfaarrf'.^elaqpapkaelvknpdparwlyrrvwrpaqaaagglav 

qatvlvfgdgselcraavaqvqrq3lkcvsitagrqfaresdmrftldpadprqldqlfaald 

gsgsrpryvlhlltlnpppdasa::ahsyyspmalahalgaheiapvsitvvtagvvavadea 

ir£?lqalivgpclvipqefpglsvrlldvnvddpaprlaerlvaelsgtdhmvalrggerlv 

advdqvdglgvgiakvplrreghy1ilgglgdigyhcarylaqtyrakltltarsslpprasw 

ERKLREGNLDSRQRTRIERVLSLEACGAEVQTAAVDLGDRHRLADVFREARGRFGAIAGVIHS 
AGIPGHVHSIDELVRVRDEAQFTAKVRGLHHLAEVVDPLNLDFCLLFSSLSTVLGGLGYGAYA 
AAICAYMDSFARRHDRPDECRWIA^CWDAWLFEAKTSSVGAELARLAIVPEDAPALFARVLERL 
PQS7IVSTADLRARIDTWIRDKNRVPPAEIRAVQPRPDLSQAYAPPIGPLEIQLCGLVSAYCR 
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FDRIGRDDSFFEIGLSSFDLIQLSSRIHRITGKDLNTTQLFSYPTVRALALFLGGEPEGIaAAE 
E PAMENLWLQRSDATLDE 

pEPOcos€_ORF13 sequences: 

(1) nucleotide sequence 

Seq ID No 65 (>pEPOcos6_ORF13 . seq) 

ATGAAATACGAAACCACCGGATTGGAATTGGCCGTCATCGGTCTCGCTTGCCGCTTTCCAGGC 

TCACCCGATCCCGAACAGTTCTGGTCGAATCTGCGCGCAGGTCGCTCCGGAATCCGCCATTTC 

AGCGATGCCGAGCTGAGCCACATCCCCGCATCCCTGCGTCACCATCCGCATTACGTCAAGGCC 

AAAGGCGCGCTGGACCACGCCGATTTCGAACCAGCCTTCTTCGGCTACTCGCCCAAAGAGGCC 

GAGGTGATGGACCCTCAATTCCGGCTGCTCCATGAGTGCTGCTGGGAGGCGCTGGAGTCAGGC 

GGCTATGCGCCGAGCCAATTCGCGGGTCGGATCGGCTTGTTCGCGGCGGCGGCCTTCAACGAC 

GGATGGATCGCCGGTACCCTCGACCGGCTGCGCACCGGCGTGGGTTTGAGCTCCCTGGAAACC 

GCGTTCTTGACCCTGCGCGATTACCTGACCACCCAGATCTCCTATCGGCTCGATCTGCGGGGC 

CCCAGCCTGCTTGTCCAAACCGCCTGCTCGTCGTCGCTGGTGGCGGTCCAGCTCGCCCAGCAG 

GCGCTGATCTCCGGCGAATGCGCCCTGGCCTTGGCTGGCGGCGTGTGCGCGACCGATCCGCTG 

CATTCGGGATACCTCTATGAACCCGGCAACATCTACGCGCGCGACGGCGTCTGCCGACCGTTC 

GACGAGGCAGGCGCCGGTACGGTCTTCGGCGACGGGTGCGGCATGGTCCTGCTCAAGCGGCTG 

AGCGACGCCCAGCGCGACGGCGATACGATCTGGGCGGTCATTCGCGGGGCGGGCGTGAACAAC 

GACGGGCACCACAAGGTTGGCTACACGGCTCCTGGCACGAGGGGCCAGGTGGCTTTGCTTAAA 

AGTGTTTATCGCGCGAGCCGGGTCGACCCGGCGACGCTCGGCTACCTGGAGGCCCATGGCACC 

GGCACCGCGCTCGGCGATCCAATCGAGGTCGAGGCGCTTACCCAGGCCTTCGCCAGCAAACGT 

CGCGGCACCTGCGGCTTGGGCTCGGTCAAGGGCAACCTGGGTCACCTCAACACGGCGGCCGGC 

ATCGCTGGACTGATCAAGGTGGTGCTGGCGCTGAAACATCGCGAAGTGCCACCCACCCTCAAT 

CTGCGCCGTCCCAATCCGAAAATCCGCTTCGACGAGACGCCGTTTTTCCCAGTCGTCGAGTTG 

CAACCCTGGCCAAGCGGGACCGGCCCCTTGCGAGCCGGCGTGAGCTCCTTCGGCATCGGCGGT 

ACGAACGCCCACGTCATCCTCGAGGAGGCACCGCCGACGGCCAACCCGGCGCCACACGGCAGA 

TTCCGACTGTTGCCGCTTTCGGCCAAGACACCGGCTGCGCTCGAAGCGAAGCGCCGCGATCTG 

GCC3GCTTCCTCGAACGCCACCCGGAGACCTCCTTGGCCGACCTCGCCTTTACCCTGCAACGC 
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GGCCGCGAGGTCTTCAGTCACCGCGCCTGCCTCGCCGTGGAGACCTTAACGTCCGCGCGCACG 

CGGCTGAGCGGCGAGTCGTCGAGCACTTGCGTGGTGGGCCCCGCGCCCAGCGCCATATTTCTG 

TTCCCTGGTCAAGGCAGCCAGCTCGCCGGGATGGGCCGCGGTCTGTATCACCATTTCGAGCCG 

TTCCGCACGGCCGTCGATGCCTGTCTGCGCGAGCTGGAGCCAGGACTGCGGCAAGCGCTCAGC 

GCCCATTTCGATCCGAATCGCGGCGCGGACCCACCCGATTCGACGACCTTCGTCCAACCCTTG 

TTGTTCCTCGTCGAGTACGGGGTGACCGAGTGGCTACGCTGCTTGGGTGTGCGGCCAACAATG 

GTGTTGGGTCACAGCTCTGGCGAGTATGCCGCAGCCTGCGTCGCGGGCGTTCTGTCGCCGTCC 

GCGGCGGTCTCGCTGCTGGCCGAGCGCGAGCGGCTGCTGCGCGACCTGCCAGCCGGCGCCATG 

CTCGGCGTCCCGCTGGCCGCCGAGGCGCTCGAGGCGATGTTGCCCGACGCTCTCGATCTGGCG 

GCGATCAACGGCTGTCAGCTTTGCGCCGTGTCCGGGCCGGTCGCGGCGGTCCACGCCTTCAAG 

GCCCAACTGGAAGCCGCCGGACATCACGCCCGCCTGTTGCACACCGATCGCGCCTTCCACTCG 

CGGCTGGTAGCACCGGTGCTTGACCGGTTCCAGGCAGCCGTTCAACACGTGGAGCTGCGGCGG 

CCGCAAGTACCTTACCTCTCGACCGTCAGCGGGCGATTGGAGGCGGATGGGCCGGCGAACCCG 

CACTACTGGGTGCGTCACCTGCGCGACACGGTGCGGTTTGGTCCAGCCCTGGAGGCGCTGCCG 

CCGGTGGATTCCTTCGTGTGCATCGAGGTGGGACCAGGCTCGGCCTTGAGCACCATGGCGCGC 

GAAACGTTGGGTTCCCAGGCGCGACTGATTTCGTTGCTGCCGCGGCCGCGAACGGGGCAAATC 

GAGCCCGGTCCGGTATTCGAACGACTGGCGGCGCTTTGGCGCAGCGGGTTGACATTGGATTGG 

TCTAAATTGACGGGCGGCGAAGAGGGTCATCGAATTCCCTTGCCAGTCTACCCGTTTCAGCGC 

AGCCATCTGTCGAGCTCCCTGGCGGCGGGCCACACGCCTTCGTCGCGGCCTGCAGTCGAATCA 

GGCGCCATCCTTGCCGAGCGATCCGCAGGGGAAAACGCTGAAACCCGGGATTGCCCGCTGCCA 

ACCGCCACGCTCGAGCCCAAGGCGGTCGCTCCGGCCCCACTCGAGGCTACCGACGCCGCAGGT 

ACTCGCGAGCGACTGGCCGAACTTTGGCGCGAGTTGCTAGGGTTGACCTCGATTGGGCCCGAC 

GACCATTTCTTCGACCTGGGCGGCCACTCGCTGACCGCCACGCGGCTGCGCGCCCTGATTCAC 

CAGCGGTTCGATGTCGATCTCGGGCTCGACGAAATCTTCGCTCATTCGCGTCTCTCCCAGCTG 

GCC3CCCGTATCGAGGCGGCGGCCAAGAGCCGATTTTCCTCCATTCCCAGCGCGCCGGACCAG 

GAC3ACTATCCCTTGTCATCCGCCCAGCAGCGGATTCACAGCATCGTCACGAGGGCCGAGGTC 

GGCACTGCTTATAATTTTCCGATCGTCCTCGAGCTGCAGGGCGCTCTGGATCGAGTGCGATTC 

GAGGCGACGTTCGCGGCATTGTTCCGGCGTCATGAGGGGTTCCGCACCCGCTTTGTGATGCGC 

GATGGCGGGCCGCGCCAGCGCATTGTACCGGACGTGGCGTTTCGCCTGCCGCTCACCCAGGTC 

GAGCCAGAGCAGGTTCCCGGGCGCATCGAGGCCTTCATCCGTCCCTTCGATTTGGAACGCGCG 
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CCGCTGTTCCGCGCGGAGCTGTTGCAGTTGGCCGAGCAGCGCCATCTGCTACTTTTCGACATG 

CACAACTTAATTGCCGACGGTATCTCGCTCAACCTGTTCGTCGCCGATTTCGCGGCCCTGTAC 

CATGGTCGTCCGCTGGCGCCGCTGAAACTCCGCTATCGCGACTATGCCGTTTGGCAAGAGGCG 

CGGCTGGCCTCCGATGACCTGCGCAGCCAGCGCGAATGGTGGCACCGGCGGCTTTCGCCGCCG 

GTCGCCACGCTGGCGCTCCCTCCCGATTTCCCGCGTCCGGCGGTGCGCCGCTACAAGGGCCGT 

AATGTGGTGTTCCACCTGGACCGGGAGATCCGCGACCGCCTGGTGGCCCTGGCTCGAACCCAG 

GGGGTCACCATGAACGTGATGATGCTGGCGCTCTGGGCTGCGCTGCTGCATCGCGAAACCGGC 

CAATCGGAGCTGGTGGTCGGATCGCTGCTCGGCGGGCGGCCGCACAGCGAGCTGCATCCCGTG 

ATCGGGCTCTTCACCAACTTTTTGCCCTTGCGGTTGGCGGTCGAGGGATCGACCCGCTTCGAT 

CGCTTCCTTGCCGCTTGCCACCAGGCGTTTCTCGAAGCCTATCAGCGCCAGGACTATCCGTTC 

CACTTGTTAGTCCAGGAACTCGTGCCGGTCAGGGACCCGTCGCGGTCGCCGCTGTTCCAGACC 

TCGCTCGTCTACCACAACGAAATTGACGGCAAGACCAAGCTGGAATTGGAAGGGCTGAAAGTC 

GAAGTGGTTCCCTTCGAAAAGGGTGTGGCGAGGCTGGATTTGAAGCTGGATGTGACACCTTTT 

TCCGACCGACTCGAATGTGTTTTGCAATACGACTTGGATCTGTTCTGCGAGGAGACGATGCGC 

GGCCTGATCGCGCGGTTCCAGGCGTTGGTGGCGGGGCTTGTCGCCGATCCGGCGCAATCGCTC 

GCCGCCGCGAGCGTTTCCGGGAAGCGGGCGCTGCGCGCGGGCGTGGCCACGGCAAGCGAATCG 

TCGCCGCAGTCACTGCCGCCGCAACCATCGACGGCGTACGCCACTCCCTCACCGCAGTCACCG 

TCGCCGGTAGTCCTGACGGGACCCGCCGACCTGCCCGCGATCTTGGCGGCCTACGTGGGGCAG 

AACCCCCATCCGTTCGCGATCCATCGGGGTCTCATTTTGGAGGCGCCGCTGGGGTTGCGAGCG 

CTGCGGTCGGCGCTGGACGCAGTGCTCGGAGAACACACCCATTGGCGCAGCGTGCGTGCGGGC 

GATCGCGCGCGGCGCGTGGATAAGTTGGAATTGACCAGCCTGGTGCGGCTCGACGACCTGCGC 

GGGTTGGTCAATCCTCAGGCGAATGCCTTCACCCTGGCTTGGCGCGATCTGGCGATGCCGTTC 

GGGGAGGGGCGTCCCCTGTGGCGACTCCGCCTGGCGTGGTCGGCTCCATCGCGCTGGTTGCTA 

TTGCTGACGGTTCATCCATTGATCGGCGACAACGGCACGGTCGACCTCTTTCTGGCGGCACTC 

GCCGATCACCTGCGCCGCGCGTCCGCTTTTCCCGTAGCACCGCTCGATGAGGCCGAGCTGGAG 

GCGGAGCTGAAGTGGGGAGAGGAAGGGGAGGGCCTCGGGCTGACCGCGATCGCGCCGGTCCTG 

GGCCAATTGCGCGAAAGTCGGCTGAGTCCTGTGGCCCAGATGTGGCTGGACGAGGTGTGTCGC 

CGCCACGACCTCACCCCGCTAGAGGTCTTGGCGGCCCGGCTCCTCGATTGGACACGAAGCCAC 

GGTCACGGGTCGATCGCTTTGTGGAGGCCGCTGCCCGAGGACCATCCGCTTCGCGATGAAGGC 

CGCTGCCTCCAGGTTCGCCTGCTGGAGGGGCCGCCGTCGCAGCGAGGAGCGGGCGATCCAAGC 
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TGGCTCGAGCAAATCGCCTTGAGACGGGGTACCCCTGCAACGGAGGTCGTTTGCCCTACTCCG 
ACCCAACGGGCAGCCATCGACCTCGCGCTGGCCTGGCTGCCGCAGCCGCCTCTTCACGGTTTG 
GTCGGAACCGTTCAGCCGTGGCCGGAATCTCCATTGGTCTGTCCGTTTCCCCTCAATCTCGCG 
TTCCGGCCAAGCCATCCAATTGCCTACGCGCTCAAGCACGAGGCCACGCTCGCGGTCACGGCA 
CGGGCGCGCGATCTGATGCGTTTCCTCGACGGCTTGGGCCCGGAAAGC 

(2) peptide sequence 

Seq ID No 66 { >pEPOcos6_ORF13 . pep) 

MKYETTGLELAVIGLACRFPGSPDPEQFWSNLRAGRSGIRHFSDAELSHIPASLRHHPHYVKA 

KGALDHADFEPAFFGYSPKEAEVMDPQFRLLHECCWEALESGGYAPSQFAGRIGLFAAAAFND 

GWIAGTLDRLRTGVGLSSLETAFLTLRDYLTTQISYRLDLRGPSLLVQTACSSSLVAVQLAQQ 

ALISGECALALAGGVCATDPLHSGYLYEPGNIYARDGVCRPFDEAGAGTVFGDGCGMVLLKRL 

SDAQRDGDTIWAVIRGAGVNNDGHKKVGYTAPGTRGQVALLKSVYRASRVDPATLGYLEAPIGT 

GTALGDPI EVEALTQAFASKRRGTCGLGSVKGNLGHLNTAAGI AGLI KWLALKHREVPPTLN 

LRRPNPKIRFDETPFFPWELQPWPSGTGPLRAGVSSFGIGGTNAHVILEEAPPTANPAPHGR 

FRLLPLSAKTPAALEAKRRDLAGFLERHPETSLADLAFTLQRGREVFSHRACLAVETLTSART 

RLSGESSSTCWGPAPSAIFLFPGQGSQLAGMGRGLYHHFEPFRTATOACLRELEPGLRQALS 

AHFDPNRGADPPDSTTFVQPLLFLVEYGVTEWLRCLGVRPTMVLGHSSGEYAAACVAGVljSPS 

AAVSLIAERERLLRDLPAGAMLGVPLAAEALEAMLPDALDLiAAINGCQLCAVSGPVAAVHAFK 

AQLEAAGHHARLLHTDRAFHSRLVAFVLDRFQAAVQHVELRRPQVPYLSTVSGRLEADGPANP 

HYWVRHLRDTVRFGPALEALPPVDSFVCIEVGPGSALSTMARETLGSQARLISLLPRPRTGQI 

EPGPVFERLAALWRSGLTLDWSKLTGGEEGHRIPLPVYPFQRSHLSSSLAAGHTPSSRPAVES 

GAI LAERSAGENAETRDCPLPTATLEPKAVAPAPLEATDAAGTRERLAELWRELLGLTS IGPD 

DHFFDLGGHSLTATRLRALIHQRFDVDLGLDEIFAHSRLSQLAARIEAAAKSRFSSIPSAPDQ 

DDYPLSSAQQRIHSIVTRAEVGTAYNFPIVLELQGALDRVRFEATFAALFRRHEGFRTRFVMR 

DGGPRQRIVPDVAFRLPLTQVEPEQVPGRIEAFIRPFDLERAPLFRAELLQLAEQRHLLLFDM 

HNLIADGI5LNLFVADFAALYHGRPLAPLKLRYRDYAVVJQEARLASDDLRSQREWWHRRLSPP 

VATIALPPDFPRPAVRRYKGRWVFHLDREIRDRLVALARTQGVTMhTVMMLALWAALLHRETG 

QS ELWGS LLGGRPHSELHP VI GLFTNFLPLRLAVEGSTRFDRFLAACHQVFLEA YQRQDY P F 

HLLVQELVPVRDPSRSPLFQTSLVYHNEIDGKTKLELEGLKVEWPFEKGVARLDLKLDVTPF 
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SDRLECVLQYDLDLFCEETMRGLIARFQALVAGLVADPAQSLAAASVSGKRALRAGVATASES 
SPQSLPPQPSTAYATPSPQSPSPWLTGPADLPAILAAYVGQNPHPFAIHRGLILEAPLGLRA 
LRSALDAVLGEHTHWRSVRAGDRARRVDKLELTSLVRLDDLRGLVNPQANAFTLiAWRDLAMPF 
GEGRPLWRLRIAWSAPSRWLLLLTVHPLIGDNGTVDLFLAALADHLRRASAFPVAPLDEAELE 
AELKWGEEGEGLGLTAIAPVLGQLRESRLSPVAQMWLDEVCRRHDLTPLEVLAARLLDWTRSH 
GHGSIALWTPL.PEDHPLRDEGRCLQVRLLEGPPSQRGAGDPSWLEQIALRRGTPATEWCPTP 
TQRAAI DLALAWLPQP PLHGL VGTVQPWPES PL VC PFPLNLAFR PSHP I AYALKHEATLA VTA 
RARDLMRFLDGL.GPES 

pEP0cos60RF13 . 1 sequences : 

(1) nucleotide sequence 

Seq ID No 67 ( >pEPOcos6_ORF13 . 1 . seq) 

ATGACGCAAGCCTCGGCCGCGTCGACGTCCCAGGTCGCGCCGGAGGTCACCCCCGGCCGAAAG 
GACGACGATGACGATCAAATCCGAGATGTCGGCCGTTGCTCACTCTGCGGAGAGCGGCTTCCG 
CGCTGGGCCACGCGTGGGCGGCGCGATGAAGCGGGGCCGGACGCCGGAGCAGGCCGGCGTGAA 
GCTGCTCCGCGCCCCGGTGAAGCGGAAGTGGCTGCCCCCGGCGCCCGTCCTGCGCCTGAGCGA 
GCGGCGTATCCCGGAGGTGTGGGCAGGCTACCGCGCGAGCGCGGGATGACCCGAGCCCCGCCC 
GCCGGCGCGACCATGACGCCGCCCCACGGGGCGAGTCGTCCGGCGCGCCGGCGCGCGTCGGGG 
CTTCCGCCGCCGGGCGGGCAGGTGCAGGATGGTCGGGCATGG 

(2) peptide sequence 

Seq ID No 68 { >pEPOcos6_ORF13 . 1 .pep) 

MTQASAASTSQVAPEVTPGRKDDDDDQIRDVGRCSLCGERLPRWATRGRRDEAGPDAGAGRRE 
AAPRPGEAEVAAPGARPAPERAAYPGGVGRLPRERGMTRAPPAGATMTPPHGASRPARRRASG 

LPPPGGQVQDGRAW 
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pEPOcos6_ORF14 sequences: 

(1) nucleotide sequence 

Seq ID No 69 ( >pEPOcos6_ORF14 . seq) 

ATGGTGACGCGTCCGACGTCCGACGGCATCGAGGACGAGCTCGCGCCGTTCCCCCCGGTCCTG 

CGCGGCTGGCTCATCGAGGGCGAGCTCGGCCGCGGCGGGATGGGGCGGGTGTTCCGGGCGCGG 

CACCCGAAGACGCGGGCGCGGGCGC-CGATCAAGGTGCTGCTCGGCGACTACGCCCGCCGGCCG 

GACGTGGTGGCCCGCTTCCGGCAGGAGGCGATCGCCGTCAACATCATCAACCACCCGGGAATC 

GTCCGCGTCTTCGACTCCGGCGAGCTCGAGGACGGCTCGCCCTACATCGTGATGGAGTACCTG 

GACGGCCGGGGGCTGCGCGACTGGGTGCAGGCCGTGCCGCCCGCGGAGCGGCCGCGGCAGGTC 

GTGCGGCTCGGCTACCAGATCGCCTCGGCCATGGCCGCGGCGCACGCGTCCAAGGTCGTCCAC 

CGCGATCTGAAGCCGGAGAACATCATGGTGGTCGAGGACGAGCTCGCGCCCGGGGGCAGCCGC 

GTCAAGATCCTCGATTTCGGCATCGCGAAGGTCCTCTGGGGAGGTCTGCCCGAGGTGCTGGAG 

CTCGAGGGGAGAGGCTCCCTCGCGCCCGCGTCCGCGTCCACGATCCGCACCGAGCTCTCGACG 

CGGCCGGCGCCGACGGTGGGCGCCACGACCGGCCCAGAGAGCCCGCTGGGCGCGAGCGCCACG 

CCAGAGAGCGCCCTGGGCGCGAGCGCCACGCCAGAGAGCGCCCTGGGCGCGAGCGCCACGCCA 

GAGAGCGAGGCCCACGAGGAAGACGCGCTCCGGAGCCTCCCCGTCGTGACCAGCGGCAGGCCC 

GCGATCCACCCCGCGCCGGTCGAGATCCCGCCCGAGGCGGTCTCCTCCGCGGCGTCGCGCGGG 

TCGCGCGCGTCGATCGAGCCAGGCGCGCCCGCGCCGCAGAGCGAGGGCGCGGGACAGCCCACG 

ATGCCGTTCACGCAAGAGGGCGTGTGGGGCCTCGGGACGAGGAGCTACATGGCGCCGGAGCAC- 

GAGCGCCACTCCGGGAGCGTGGACGTGAAGGCGGATGTCTACTCGCTCGGCGTCATCCTCTAT 

GAGCTGCTCGAGGGGCGGACGCCCGACGCGCCGAGCGCCGCGTGGCCGCCCCCGATGAGCGCC 

GCCACGCCGCCCGATCTCGTCGCCCTCGTCCACCGGGTTCTGGCGTTCGATCCCGATGCGCGG 

CCGCGCATGGCGGAGGTGGCGAGCGCGCTTCACCGGCTCGGCCGGGCGAAGAAGGAGCTCGAC 

GAGGCGCTCTCGAGGTGGGTCGTCGGCGGAGGGGCGCCGGGGCTCTTGCCGTGCGGCTATGCT 

CTTCTCGAACTGGTCCTCCTGGGCCCTGGGAACTTATACGATTCTTTCCAGCCTGTAAGTGCA 

TTTTTCTTTCAATATCGTCCTCTCTTCATATACGAGGTGAGTTCTCTGAGGTCCTCCTATAAG 

TCTGGGGTGTCCTATTCGGCCTCTTACTTGTTACTTCGCCTTCTTAGGAGTTTTTCCTTAATT 

TTGCCCTCTTACATTCCCGTATTCATTCTAACTGGGCCCTATCTCATTCGC 
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(2) peptide sequence 

Seq ID No 70 ( >pEPOcos6_ORF14 . pep) 

MVTRPTSDGI EDELAP? P P VLRGWL I EGELGRGGMGR VFRARHP KTRARAAI KVLLGDYARRP 
DWARFRQEAIAVNIINHPGIVRVFDSGELEDGSPYIVMEYLDGRGLRDWVQAVPPAERPRQV 

VRLG YQ I AS AMAAAHAS KWHRDL KP EN I MWEDELAPGG S RVKI LD FG I AKVL WGGLPEVLE 
LEGRGSLAPASASTIRTELSTRPAPTVGATTGPESPLGASATPESALGASATPESALGASATP 
ESEAHEEDALRSLPWTSGRPAIHPAPVEIPPEAVSSAASRGSRASIEPGAPAPQSEGAGQPT 
MPFTQEGVWGLGTRSYMAPEQERHSGSVDVKADVYSLGVILYELLEGRTPDAPSAAWPPPMSA 
ATPPDLVALVHRVIAFDPDARPRMAEVASALHRLGRAKKELDEALSRWVVGGGAPGLLPCGYA 
LLELVLLGPGNLYDSFQPVSAFFFQYRPLFIYEVSSLRSSYKSGVSYSASYLLLRLLRSFSLI 

LPSYI PVFI LTGPYLIR , 

or DNA sequences complementary to said open reading frames, 

(b) DNA- sequences which hybridise under stringent conditions 
to regions of DNA sequences according to (a) encoding proteins 
or to fragments of said DNA sequences, 

(c) DNA-sequences which hybridise to the DNA-sequences accord- 
ing to (a) and (b) because of a degeneration of the genetic 
code, 

(d) allele variations and mutants resulting by substitution, 
insertion or deletion of nucleotides or inversion of nucleotide 
segments of DNA-sequences according to (a) to (c) , wherein the 
variations and mutants offer isof unctional expression products. 

10. DNA sequence according to any of claims 1 to 5, wherein 
the DNA is selected from the group consisting of 
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(a) che following DNA Sequence: 



Seq ID No 71 (>Contig4 3) 

CGGGTATTTGTGATATGTGGGCNGTAGTCGTATGCTTCATTAAGTACATC 
5 CGTCCGTNGTAGAGAGTGACTCTGTCGCAGCGATAATAGACACGCTTGTG 

15 ATGCTATAGGGAACATAGAGTCNTAGTAGATGATACGACGAGATATTNGT 

ATAGAGCGTATAGACCGACGTGTGAGCGTCATAAGTGTTGTGTGTCATGA 
GTGTGCTCAGAGGACGTGCAGACATTATATGAGCAGATGATGAGAGAGAA 
TCAATGCTGCAAGNTATTCGTCGAATCTACATTATATCGAATCGTGTATG 

20 10 TGC3TTTGTCGCAGCGCGATNCGATGAGATACCGAAAGGGTATGTATCTA 

TNTTCGTGACGCTCGATNAGAGCAAATCCGCTACCGTGGAGATATCGTGT 
ATCGACTCCATCACGATCAGTATCATGATACGTCAAACGAGTACACTCAT 
TA rr TGATAACACACGTANGTGTGCATGCACAGTTATCGAGTGTATTGTGT 

25 

GCATGAGAGGTATAGGATNTATAGGCGAGCATATATATCTATATATATAG 
1 5 GTTAAGAGTAGAANACTATGAAGATGCAGGAAGTAGTATCTCGCGGACAA 
ACGGNGTACCTAGCGGGGTTGAAGTATTATCGACAGTGTATAACGACTCA 
30 ACAC-GGNTACGAGGTACATTGTATTTACAGTGGTTGGAAGGATTGCGCGA 

GGAAAGGTAGTGGTACCGTGTGAGCTACGATGCTCGGGATAATGGTGATT 
AGATAGAACCTTAGCGTTGCTAGATGAGTGAGTGGTGGTATGAGTAGAGT 
2 0 TTTTGTTCTAGCTTTGTGTCCAGCGAGGATTCGTTCAGTCTGAAGGGTAA 
35 GAGTACGTCCATCGCACACCCGACCGTTTTGAGGAGTTCTCGGTGCGTGG 

TCAGTGGGGTTTGGAGAAGACAGAGTTGATTCATAGGGTTATCAAACGAG 
TTATGTGGATAGATGGT AGTGAC C CC ATTTG AGTGAGAGTGTTGGCGTT A 
ACANCAGCAGGATNTAT 



25 



' SEQ ID No 72 (>Contig44; 

TAGGTCTTTGACACCATGGGAGCTGCTACCGATGTTGCCGAGCACGATCG 

45 CGC2GGCGCCX3ACGAGCGACTGCAAGCCGGCCGCGGCCATTTACGCCTGA 

CGAGCGAGGTGC3GCGAAGTGCTGGTGCGCGCCGTGCGTGTCGAGCGCGCC 

30 CAGGTCCGCCGTTGCGCCGTCGCCGAGCAGTAGCGCGCCGTCGAAGACGA 

50 



55 
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TCACCGCGATCGAGGTCAGCGTCGTGGGGGCGAGGCCGAGGAGCGCGAGG 
ATGCCGAGCACGACGCCGGCCGCGCCGTAGACGAGCTTGGCGCCCATGCC 
GCCGCCGAGCTCGGTGCGCGTGTCCCACTCGACGGGCGGCGCGGTGCGGC 
TCAACGCGCCGAAGCGCGAGGCGATCGCGCCGCCCTGCGCGATCAGCGCG 
5 GCGCCGAATACGATCGTGGCGATCTGAGGTGAGCTCTACTGGCATGATCC 
CCGTCAGCCCGAGGATGGTGAGGACAATCGTCGCGGCGCCGCACAGTACC 
TCGCACGAGCGAGCCTCCGAGCACGACCTTCGGCGTCGTCTCGTCCTTTG 
GTCTGCGTCGCGCGCCCGAGTGCGGCGTTATGTGGCTCTCCGGCTGTGCA 
AACCGTTCACGTTCTTCCGGTCCTGGAGTCAGCATCGGCATGATTCCCCC 
10 GTCCTGCGGTGAGGCCTTGTCGCGCTCACGCGCGCTCCGACTTGCACGTG 
CTGTGCCGGGTTCTCTCGCTCAGGAGGCGCCTCTCTTGGTGGTGCTTGCG 
TCCTGGTCCGTTTGCCCGCCTGTGCGGTAGGTTTCTTGAACCAGGTGACC 
TTCAGGGACCCCTTGATGCGCTCCATCGTGTCCTATGTCGATCCTTCTCT 
GACTTGTATGGGTCTCGAACCAACTACGCTTGATCAGGCCTTCGAAGGGT 
1 5 CCTTTGGGAGATCGACTCTGGATCCATACCGGGAGCCCCTGTTCTGCCGC 
TCTCTTAAGTTTCCCCTTCTGTATCCGTGTCGACCGGAAACGCTTTATCT 
CTAATGCGCTCTAATTGCGTCTCTGCCACACGTGCGCTTCACTCTGGATC 
TACTTCTTCTCCCTAGTCTTCTACCTCCGTACCCTTATTTGTTGGTTCTA 
TTTATTTCTTTTCGCTTCACCTCGCGTCATTGTCGCCTAGTGTTCCTCCC 
2 0 TCATATCGCCTTTGGTCTCCCTCGAGCGTACAGTCCTCTCTCTTCAGATG 
CTTTCCGGCTCCTCTTCTGCTGGCCCCTTATCCTTTCTAATACTTC 

SEQ ID NO 73 (>Contig48) 

ATGCGCCCAGGAACACCCCGGTGCGGCTGCCGTCGAGGGACTGGGGTGCG 
2 5 ATGCCGGCGTCCTCGAGCCCTTCCCAGGTGACCTCCAGCAGCAGGCGTTG 
CTGAGGATCGAGCGACCGCGCCTCCCGAGGCGAGGTGCCAAAGAACGCGG 
CGTCGAAGCCGTCCACCGCCTCGGTGAGCAGTCCGGCCCAGCGCGGCACC 
TCCTCGCTGGGATGGACGCCGACCAGCGCCCAGCGCCGGTCGAGCGGCTG 
GACCGCGTGTCGGCCTGAGTCGAGCAGCTCCCAGAATGCCTCCGGAGTGT 
30 CCGCTCCGCCGGGGAAGCGGCAGCCAATGCCTACGATGGCGATCGGCTCG 
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GTCCGCTCTTGCTCCAAAGACGCGTTCTTTTTCGCAAGCTTGTCCATGAG 
CAGAAGGGCATGCTCAAGCTTCCCGGCATTCGTGGTCGCCATACTCCCTC 
GGTCCCTTACTCACCAACGATCTGCGCGAGCTGCGCCAGCTTTTCGGCGA 
GCAACGCGTCCTTCTGCTCGTCCGTCATGCCCCGCAGAGCCTCGAGATCT 
5 GCGGCATCGTTCTCGAAGCTCTTCTCCCGCTCGGTGGCCGGAGCGTGGGT 
CGCGCCGGCATTCGGAAACAGAATGTCTAGCAAGCTCCCGCTCAGAGCTG 
CTACGTTAGGGTAGGTCCATAGCAGGGTCGCCGGCACGGTGATGCCGAGC 
GCGGCCTCGATGCGGTTGCGGAGCTCCAGGCCTATCAGCGAGTCCATGCC 
GAGATTGCTGAACGGCACGTGCCGCTCGATCCTCTCCGGCGGAAGGCGCA 
20 1 0 GCCCCCGCCCCAACAGCTCGCTCAAGTGCTTCTCCAGAATCAACTGACGA 

TCTTCGGGCCTGGCGCTCTGCAGCGCCTCGCGCAGGTTCGACGCGTTCGA 
' CGCGCCTCGGTCGGCGCGGTCACGCTCCTTCAGCAGCTCCGCCCACAGCG 
CCAATCGGGCCGCGTTGGGATAGAACTC 

25 

15 SEQ ID No 74 (>Contig4 9) 

ACCACCGCTTCACTCAGTATGTACTTTGTTATACTCGTCTTAGTACAATG 

20 ATATAATACTCATGTGTATTCTTAATCTCGGGGAGANAAAATTGGAATAC 

TGGACACCGTTGCCGCATGCNGACTCTAGAGATCCCCCTGCGACGGTATC 
CCACGGCACCGGTATGGCCGGCGCGCGCTCCGGGGGTCAACGCCCCGTGG 
20 TTGCCTTCACGACAACGCCGGTCGGGCGGGGCGCCGTTCGATGCCGCGGG 

35 CCCGCGCGCGGCGGCGCGTTATCCTGTGGAGCATCTGGAGGGCGCTCACG 

CACCTGTCAGTCTAGTTCTGGCCCGCCCGGAAGGAGTCCGGGAGGCCGAA 
GTTGAACCCGATGTAGAGCGCGATGAACGACGGGAGCACGCGCGCGGGGA 
TGTGCAGCGCGGCGCCGATCGGCGTCGCGAACAGGACGAGCTCGCCCGGC 

40 

2 5 ATGCCGGGCACGACATACCCGAGCAGAAACACGATCGGCACCACGAGCGT 

GAGCTCGAGCAGCGATATTTCATGACCGACCGCGCGGGCGGCGGCGCCCG 
CCATGACGAACACGCAGATCAACGTGCCGTTGACGTTGAGCCAACCGCCG 
45 AGTCCCACCACGAAGAGCCTGAGCTCCTGCGGCACCGCCGGATAACATTT 

GCGGACGAGGTGCAGGTTGAGCGGCGTCGCCAGCGCCTCGCTGCACGAGG 

3 0 CCCACAGCAGCGGATAGACCTTGAGCCAGTAGTTGACGAAATAGTCGCGC 

50 



55 



10 



20 



25 



35 



40 



WO 00/22139 PCT/US99/23535 

145 



AGCGAGAACTCCGGGGCGGCGGCC7TCATCCGCAGCAGGCTCGCCGCATG 
GAAGACGAGGCAGGCAGCGCCGACGACCCCGGACACGAGCAGATAGGAGA 
GCATGAGGTCTCCGGCGCCGAGCGAGGGCGCCCCCGCCGAGGCGCGCGCG 
AGGTCCCCCCCGTGCACCTGCGCGGCGAGCTGCGCGGGCAGCCCGCGGAG 
5 ATAGGCGCCGAGCCCGAACATGAAGAGCGGGACCAGGCACTGCACGGCGC 
15 CTCCCGCGCGCTCCAGCGCGTCCGC3GCGCGCTCCAGCGCGCGGGCGACC 

CGCGGCGCGCGTACGGCCGCGAACGACGTCACGATGCCGGCGTAAAGGGC 
GAGGAAGCACGGGCTCGAGATGACCAGGCCCGAGGCGCTATAGAGGGTGC 
GCGCGGCCTCGAACGGCGCGCCCG7GCTGTGGCTGGGGAGCAGCGGCAGC 
1 0 CCGAACACGAGCCATGTGACGACG.-. CCCCGAACAGGCACGCTGCCAGACG 
CTTGAGGGCGAGCCAGCCCATGA"7ACGCGAGCAGCCGCCCCGGGCGCC 
CTTGCCGGTGCAGGCTCACGAAGGTCGGCACGAGGACGACGAAGATGACG 
ACCGGCGCCAGCGTGGTGTACCAATGCAGGAGACCGTCCATGGCGCGGGT 
CGACCACCGCGTGACGCTGGTCTCTCTGTCTGACTCGATCATGGCCCATT 
15 CGCCTAAAACTAATGATCCGTTCTCAAATTGGTCAAAAAAAAGTTCCCTT 
AAGACTGTTTTACTCCGGAATATTAATATATTTCTGAGTGTGAGGTGATG 
30 TTAATCACACATTCTGATATTCTCAAGGGGAATCCGTGTCATTGTGAATA 

CTTCTCTCTCTACAAGAGAGGTTATATATGGTCTCGAATATCTCGTCCGC 
TCTTATATATATTCTCTTGTGATAATATATATCGAGTGTGGGTACTCAGC 
2 0 TCTCTTGGTGTAATCTATAACTCG3CATCTCTCATAATACCTTATATATA 
CACACTCTCTCGGTCATATCTCGCATAATAGATATATTTTATATGTTCCG 
CGTTTTATCCGAGTGGGATACACTTTTTCTATATTTTCTTTGGTGTGACG 
CGTGGCGTCGAGCCTTATTATTGATTTGGTAGTCACGATATTCTCTAGAT 
GACATCATACAGATGCTCATAACTCGATAAACACAGGTCGTACACGACGA 
2 5 GACTCTCACTCTCACTCTT 



SEQ ID No 75 (>Contig5C 
45 TCCCCAGTTTCTCCTCTCTACGC:CCACATCTCAGCAGGAAAAAANATAAT 

GGAGAATCGTTGCGCTCTAGCAGC.-.TCTATAGGATCCCCGCTGCTCTTCT 
30 TCATGCACCTCGTGGAGCAGAAGTTCATCAACGCCTTCGCGATCATCGTG 
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GCGGTGAGCTTCCTGGCGTTGCTCCTGTCGCTCGTCGTCGCCGACGTCGC 
GACGCGGAACACGTTCCCGCCCGCGCCTTTGCCGGCGCTGAGCCCGCCGG 

10 

CGCCGGCGCTGATCCCGCCGGTGCCGGGCGGATCCGTTGGGCCGTCGCCG 
GAGCCGCTGTCGGTCGGCCGGTGATCGGTTGTGCGGGCGCCGTGCCTCGG 
5 GCTTACTACCCCCTCTCGCGGGTGGGGATATGGCCGTGGATGAGGGAGGC 

15 GATGAAAATCGTGATCGCCACGTGCGCGTTGTTCTAGATCGTCCCAGGCT 

GACCGTCGGGAGCGCCCAGCACGAGATGAAGAGCCACACCGCGAGGACCG 
TG77GAGGTACCGCACCGCAGGGGCGAGCATGGCGGTGATCGCGAAGATC 
ATGCAGAGCAGCCCGAGCACCCATGTGTTCGTCCGCTGCGCGTGGCTGTG 

20 10 CGGCCAGATGACGGCCGAGATGAGGAGCCAGAACCCGAGGACGACGTTCA 

CGATGCGCGCCATGAGATTGCCAGCTCGAACCATGCTCCCTCCCACCTCC 
GATCATGGGACCGATCGGGTCGCCACGGATCGATAACGGGCGTCAGGAGA 
CCGTCAATCGGCGAGCTCGTGAGCCATGGCGACAG'CCCGCCGACCGCGCC 

25 

GGCGGGTCTCTGGCCTGCTGGTCGCCGTGCCGGCGGCGGCGATGGGCCTG 
15 CCTCCGGTCGGGCCGCGCGGGCGCGCGGCGCTCCGCGCGAAGCTGGAGAC 
GCGGGACATCGCTCCCTCGCCCCGCGCTCGGACGAGCGGCGCGAGCCACT 

30 TCTCGACGGCCGAGCGGGCACTAAGCTTCCGTCATGAGGCTCGGCGCACG 

GCTCACCACGCACACGTTCTCGGCCGGCGCCGCCGGCATCAGCTTCGTCG 
TCCAGCCGATCCCGGGCTCGGACCAGCTGTTCGTCATTCCGATCCAGTAC 
20 CTGCTCGCGGCGTCGCTCGCGAAGGAGOGAGGCGCGCCGCTCTCGAAGGC 

35 GGCGTGGTCCCAGGTCCACCAGCTCATCTGGGGCGGCGGCGCGCTTCGCC 

TCATGCTCGGCTTGACCCTAGGGCTGATCCCGCTGGCCGGCGCGTTCACG 
AACGCGATGACGGCGTTCCTCACGACCGAATATCTCGGGTACTACGTGGA 
TAGAGCCCTCGACAACCCGGACAATCCGCCTCCGGCCCTGTCGATCCAGG 

40 

2 5 ATGTCTTGGACGCCATCACCTCCTTCTTCACCGGGCGAGCGCGGTAGGCG 
AGCGGTCCCTGGGTCGAGCCCACCCTGCGGCTCTAGGAGCCGAAGGGCGA 
GCTCCTCGGGAGCGGCGCGGCGTCACCACCAGATTCGCCGGCGCTTGCGG 
45 CCG3AGCGTATCGCGACCGCCGCCACCGCCGCCACGGCGAGCACGGTGAC 

CGC3GCGGCCGCCGCGATGGCGACGCTCCGGGCCGTGTGCTCGGCCTCGC 
30 CC77GATGCCGCCCCACCTCGCCTTGACCGTGACCAGGTCGCGCGCGAGC 
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CGCTCCTTGCTGTGCTCGATCTGCTCGGCAGGCCCCGCGGGCGCCACGAA 
ACCTGCGCCGGCGCTGACGTGACCTCGCTCGCTGAGGGGACTGGCGACCC 

10 

TCTCCGTGTCGAACCGGATTGCGCTGGATGGATCCATATGTCCCGCGCTG 
CAATCGTTGCTCCCCGCCGGCACATCGGAGTGCTCGCCGGATCGCGCGGC 
5 AGCGCCGACGCCGTACTTCCATAGGATAGCCCACCCCATCGGACAAGCCG 

15 GCTCCTGACGGCGGGCACCGAATGTTCGCCAGACGGGCACAAGGCGCACG 

CCGCGGACGGATCGGCCGCACTGGCACTCCAGAGCGCATCGACGGATGGC 
CGACGGATGTGCAATGAGGCGCCCGCACGAAGCGAATTGTCCCGAATACA 
GCGAAGAAATCTATAGCGATGCGAGCAGAAGGATATGTCTATGGGGGGCA 

20 1 0 GTCAGAAACTGGGGACAGTCAACGCACATATTCTCTCCAANTGCTAACGA 

CAGCGTGCGCAGAGGAAGTATCCTACTAGTGTAAGAGGGACATTCGATGC 
GACCGCATAAACATTCAGTCTACAACGCGTGAGAGGATGGAACACCCCGC 
CCCTCTGAAGGCTAGACAACCATGAATATGTGCAGAGGAAACACAGAATT 

25 

CCAAAGGTGAGAACATATGTAGGATCGCGCCACCCGAGATTGAGTGAAGA 
1 5 TATACATATATACTTATATGGATCTACAACATGGCGAACCGAACGTAGCA 
NAATAGTAGATATAATTGTAATACTGAGCTACCGACAGAAAGATACACAC 
30 GAGTGTACACACATCACACGCAGAGTGGTACCAAATTCACACCATGCGAG 

CCACAATGTGACACGGAGGAGCACAGCATGGGCGCCACTATGGAGGAGAA 
ACTACTGCAACCCACATCTGATGGACTGACCGCACGGACGGGACGTGTCT 
2 0 ATACATACAGATACATCNGATGGAGGAAGATGCATGTGCGATGATATCAT 
35 CGTCGCAAACTCATATGTCGAAGAAGATATGNGTCAACTCAGCACTACTC 

ACACGATACGTGAACAGGAGTGACTAGGACATCNCATGGTGTGTCGGCGC 
GTGCACGTGATATCAAACTCTCTGATCAACCACACACTATATAAGGAGTA 
TCGAGCGGCGATGGAACACCCCCTCACAGCATACGTATATGCACAACGTC 

40 

2 5 TGAACACTCTNGAGACACAGTGGAAGG 

SEQ ID No 76 (>Contig51) 
45 GATCCAGTTACGCCCCGCCGCCTCGGTCACGCCGGGGTTTTCGGCGTCGA 

CCGGGGACGGTCGGGGCGACAACCGGGGGTTGTCGTCAGCGGTTCGCGTG 
30 GATC-TGCGCGACGAAGTTGTCCGCCGCCGAGTCGTCCTGCGTGCCGCCGC 
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GCTGCCGTACGAGCAGCCCGAGGCACCGCGCGAGCTCCTGCCGCTTCCGA 
7GCGCGGCGGCCTGCTCGGCCTCGTGCTCCGCGATTCGGCGCTGGCGCCT 
GCTCTCCTCGGCGCGTTCCTGCGTCCGCGCGAACAGCACCTCCTGCACCT 
CCGCGAGTTCCTCGGCGGTAGCGACCCGCACGTGATCCGCACGTCCTCCG 
5 AGACCGGTGACGACGTCAACCGCGGAGTAGTACGCCTCCGCGCGGACGAC 
15 GTCGTCGAGGTCGAGTTCAGCGTCGGCTCCGACGACCAGGCGCATCGCGT 

CGGAGGTGAGGACGACGCCACCGTCGAGCGGCGCGGCCTCCGGCTCGTAG 
ACGTACTGGAGGTCGGTCTCCTCGTCGTCGTCACCGTCGAGGTCGACGTG 
CCGCCGCAGGCCTCGGGTCCACTCGATCGCGCGTCGTCCGGCGAGGGCTT 
20 10 CCTCGTACTGGGCCCACCACGCGCGCAGCTGCTTCGGCGTGCCGTAGCCC 

TCGGCCATGTCGGGGTCGAGCCCGGCGACCTCGATGTCCCACAGTCGGTA 
GAGGATCTGGAACGGCGTCATG6ACTTCCGGCCCCGGCCGGTCTTGGAGT 
CCAGACGGGCGGTCTCCATCGCAGCTGCGCCGGCGGCTTCGAGGTCCTGG 

25 

TCGACGGAGTCGGGCCGCTCTCGCTTCCCGTCCTGGTTCTTGGTGAGGTA 
1 5 CTCGATCAGCGCGACGTCGTCAGCTGACCGGACGATCGAGACCATCACGC 
CGTGGCCCTTGCCCTTGCACTTGCAGCCGGGGGTGTCGCAGTCGGTCGAG 

30 GGCTCGAACTTGGGGTCAGCCCGCTTGAGGGCGCCCGCCCACATCTCCCG 

GAGCCAGTCCTCCCAGTCCCCCAGGTCCGTCTCGGAGGGCTCAAAGTGTC 
CGACGACGTCACCCTTGGCCGGGGTGCCAGAGAGCTCGCCGCCGAGGAAG 
2 0 ACCAGCAGGTTGAGGTGGGGGTGGTAACCGTTCTTCTTGGACCGGGTGAC 

35 CTCAGCCGCGCGGACCATGCCGATSTAGCCGATCCGGTGGCGGATGCCGT 

CCTCAGCGGGACGGACGTACTGCGTTCCGTCCTTCCGGGTGCGGCGGGCC 
TCAGGGCGGCCGTAGAAGGCCGGGGCCGTGAGCATCCGCTGGTAGGCACC 
GGGCGCGCGGCGGGGCTTGCCCGACCGGTCGAGGACCGGGGCGCCCTTGT 

40 

2 5 CGTCCAGGAGAGGCCCGCCCCAGAGCGCGGCGACCAGGCTGTCGAGGTCG 
GTGGTCTGGTTATGCCGGGCGGTGAGGACGACAACGGCGAGCGTGCCGCC 
GGCGGCGAGGTGCCGCAGAGCACCGGTCTTGATCTCCTCGGTCCGGCCAC 
45 GGCGGATCGCGGAGGAGCACTCCGGGCATAACCAGATCCGCCCGCAGCGG 

ACCAGGCCGATCGTGACGACGTACCCACGGCTCGACTTCGCGTAGATCAC 
30 GCCGGTGTCCGGGTCGAGGACCCGCCGCCCGCACCCGCCGCAGGCGTCGA 
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TCCCGGAGACCCGGTTGAGCACCTTGCGGCCCTGGTAGCGGCGTACGGCA 
GCGGTCGCCGCGCGCCTTGTGGTCTGTTCCGAAAGGGCTGCCGCCCTCTC 
GGACTCTCCCGTTCCTCCCACGACTGCCACTTCCGCAAAGTCGCTGGTCA 
GTGGGGGGTGGGAAAACTCTGTCAACCCTTTACCTAGGCGTCCCTTTTTG 
5 CCAGGGGCGGTCTCACGGGCGGCCTCGGCGGCTCGGTCGGCGGCCTTCCG 
GGCCCGCGCGGCCCGCTTCTTGCACGCCTCGGAGCAGAACTCCTTGGCTC 
GCTTGCCGGGGGTGATCGTGAGGGCGGCGCCGCAGCGGCAACGGGGGCCG 
GCGGGGACGCGGGCGGGCGACTGAGTCGGCGCGCCGATCAAAGAGGGGGT 
TGCGGACGCCAAAGCGTCCCTTACGCTGGACACAGACGAGTACCTTGGTT 
20 1G GGTAGCCGGGTGGACGTCAGAAGCGGTCAGGGATTAGGACCCCTGGCCGT 

TTCGCTTTTTCTGGAGTTGTTCGGGTAGATCCTGCCGCATCGCCCGCCTC 
ACGCGTGGCTCGCCGCGCGGATGCCTCAGAGGCCCCACCGGTCGTCAGGA 
CGCAGACGTCGGCGTGCTCCTGGTGGTGAGTCACCAGCTCGACCACACGG 
GCGCGGCCGGCGACGTCTC 



25 

15 



SEQ ID No 77 (>Contig52) 

30 CGGGATCTGGCCTTCATTAACCAACGACGGGGCAAACATAATAGGCTGGG 

CATTGCGCTTCAGCTCACCACAGCCCGTTTTCTGGGAACATTTCTGACGG 
ATTTAACTCAGGTTCTGCCTGGTGTTCAACATTTTGTCGCGGTACAGCTT 
2 0 AATATCCACCGTCCAGAAGTTCTCTCCCGCTATGCTGAACGGGACACTAC 

35 CCTTAGAGAACATACTGCATTAATTAAGGAATATTACGGCTATCATGAAT 

TTGGTGATTTTCCATGGTCTTTCCGCCTGAAGCGTCTGCTATATACCCGG 
GCGTGGCTCAGTAATGAGCGACCGGGTCTGATGTTTGATTTTGCCACTGC 
ATGGTTGCTTCAAAATAAGGTATTACTGCCCGGAGCAACCACACTAGTAC 

40 

2 5 GTCTCATCAGTGAAATTCGTGAAAGGGCAAATCAGCGGCTGTGGAAAAAG 
CTGGCCGCACTGCCGAACAAATGGCAGGCAGCTCAAGTGATGGAGCTTCT 
GGTCATTCCGGAAGGTCAGCGTGTATCAGCACTGGAACAGTXXXXXXXXX 
45 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGCTGGAACGATAT 

ATCCGATTACGAAGTCTTGAGTTTTCCCGACTGAACTTTTCCGGTCTGCC 
3C TGCCATTCAACTGCGTAATCTGGC.CGTTATGCTGGCATGGCGTCGGTAA 
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AATATATCGCTCGAATGCCACAGCAGAGAAAGCTTGCTGTACTTACTGCA 
TTCGTTAAAGCACAGGAAATAACGGCATTAGACGATGCCGTTGATGTGCT 

10 

TGATATGCTAATTCTGGACATTATCCGCGAAGCAAAGAAAACCGGGCAAA 
AAAAAAGACTCAGGACACTGAAAGATCTTGATCAGGCCGCATTGTTACTG 
5 GCGCGGGCATGTGCATTGTTGCTGGATGATAATACAGATGTCCCAGATCT 

15 CAGGCAGGTTATCTTCAAGTGCGTACCCAAAAACAGACTGGCAGAATCTG 

TAAGCAAGGTTAATGAACTTGCTCGTCCACAGAACAXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXXXXXXXXXXAAACGTTTTCTTCCGGCGGTGTT 
GCGGGACCTGCATTTCCGTGCGGCACCGGCAGGTGAACATGTACTGGCTG 

20 : 0 CGATTCATTATCTGGCAGAACTGAATGGTTCGAAAAAGCGCATCCTTGAT 

GATGCGCCTGAACATATTATCACCGGTCCCTGGAAACGCCTCGTATACGA 
TGCGGAGGGACGGATACAGCGTGCAGGTTATTCACTATGTTTGCTGGAAC 
GCCTTCAGGATGCACTGCGCCGCCGGGACATCTGGCTTGAAAACAGTGAT 

25 

CGCTGGGGAGATCCTCGCGAGAAGTTGTTGCAAGGTGAAGAGTGGCAGAC 
1 5 TCAGCGTATTCCTGTCTGTCGGGCACTGGGACATCCTGTCGATGGACGTA 
AAGGTGTGCAACAACTGGCTATTCAGCTGGATGAGACCTGGAAAGCCGTG 
30 GCATCACGATTTGAAAAGAATGCGGAAGTTCATATCTGTAATGAAGGTAA 

ATATCCATCCCTGACTATCAGTTGTCTGGAGAAACAGGAAGAGCCACCAT 
CATTGCTTCGTCTAAATAATCGGATCAAACAGCTACTCCCACCGGTAGAT 

2 0 TTAACGGAACTGTTACTTGAGATAGATGCCCAGACAGGATTTACACATGA 

35 

GTTTGCGCATGTCAGAGAATCTGGTGCTCGAGCGCAAGATTTGCACATCA 
GTTTATGTGCGGTATGAATGGCTAAGCCCTGTAATATGGGCCTGAACCCG 
TTGATAAAGCACAATATACCAGCATTGACCCGCCATCGGCTCAGTTGGGT 
^ GAAACAGAATTACCTTCGTGCAGAAACGCTGGT 
25 

SEQ ID No 78 (>Contig53) 

ATTCCACGCGCTCACGGTCAGCTTCGACCCGCGCGAGCGCCCGGCGGCCG 
45 CCTCGCAGAAGCGCGCGGTCACGCTGTCCGAGCTCGGCGCGGACGCGCAG 

GCGCCGGAGTGGCCGTTCCTCGTCGGCGACGAGGCGGCGACCCGCGCGCT 

3 0 CGCCGAGGACCTCGGGTTCCGCTACGCCTACGATCCGACCACCGATCAGT 
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ACGCCCACCCGGCGGCCGTCTTCGTCCTGACGCCGGACGGGCGGATCTCC 
CGGTACCTGTACGGGACGGAGTTCCCGGCGCGCGATCTCCGGCTCGCGCT 
CCTGGAGGCGAGCCGCGGCGGTATCGGCACGATCGTCGATCGGGTGATCA 
TGACCTGCTATCGCTTCGACCCGGCGAGCCGGAGATACGCTCCGTTCCTA 
5 CTCGGCTTCCTCCGGCTCGGGGCGGCGGCCATCCTGATCACGGTCGGCGG 
15 GCTGCTCGCCGTCCTGTGGCGGCGCGAGCGCCGGCGGCCAGGTGCTCGCA 

CGAGCGCCGCCGTCGGTCGTGACGCCGTGGCCGACCGCCAGGGGAGGTCA 
CCATGATCAACGAGCTCCTGCGCAAGCTTCTTTTTCTGTCCGGCCAGTGG 
TCGACGATCGTGTTCGACATTTACAAGCTGCTTTACTTCGTGATCTCGGT 
20 1 0 GACGATGGCCGGCGCGACGCTCGTCGCCCTGTTCGCGGCCTACCTGATGA 

TCCGGTACCGCAGGCGCCAGCGGGATGTTGAAGGCCCGTTCCCCGGAGCG 
ACCGCGAGGCCTCCGCTCCTCCTCGAGGTCGGCATGGTGCTGGGCCTCAT 
CGTCCTGTTCCTCGTCTGGTGGGTCATTGGAATGCGGCAGTATGCAGAGC 

25 

TCCGCGTCGCCCCCGCGGACCCGGTCGTGGTGTACGTGACCGGGAAGCAG 
1 5 TGGATGTGGAAGTTCGCCTACCCGGAGGGCCCGAGCTCGGTGGCGACGCT 
CTATGTGCCGGCGCGTCGGCCGGTGAAGCTCGTCATGACGTCCCGGGACG 
30 TGATCCACAGCTTCTTCGTCCCCGATTTTCGCATCAAGTACGATGTCGTC 

CCCGGCCGCTACACCACGCTGTGGTTCGAGGCGACCGCGCCGGGCGCCTA 
TCAGATCCTGTGCACCGAGTACTGCGGGACGAACCACTCCACCATGCGCG 
20 GCGAGGTGATCGCGCTCGAGCCCTCCGATTTCGCGCGGTGGCTCTCCGAC 
35 CGCGGGCGGGGCGCCGGTATCGCCGGACAGGAGTACACGCCGCCGTCGAC 

GCCGGGCGAGGGGATCCCGCGCGAGCCGCTCAGCCTCGTCCGGCTGGGCG 
AGAACATCGCGGCCGAGGAGGGCTGCCTGCGCTGCCACACGCCGGACGGG 
ACACCGCACATCGGGCCGACCTGGGCCGGCCTCTACATGTCGGTCGTCCC 

40 

2 5 GCTGGAGAGCGGCGGCGCCGCGGTCGCCGACGACGCGTACATCACCGAGT 

. CGATGATGGATCCGCTCGCCCGGATCCACCGCGGCTACCAGCGGGTCATG 
CCCTCGTTCCTCGGCCGGCTCCAGCCGGCGCAGGTCGCCGCCATCGTCGA 
45 GTACATCCGGTCGTTGAGGGGCGTCGCGCCGGAGCCGGGCGCGCGGACGC 

w 

CGCTGCCCGAGGGCCCGCCCTTCCTGCGCTCCGGCCCGGAGCGCCCCGCC 

3 0 CCGCTCAGCGGGGGCGCGCCGGTCGGCCCGATCGAGGGCGGCAAGCCCGG 

50 



55 



1 



10 



15 



25 



WO 00/22139 PCT/US99/23535 

152 



GGAGGAGCTCCGATGAGCACGGAAC-CGTACGAATCTCTGCCCGACGCGCC 
GGCCGAGAGGCCCGAGCCCCGACTACCTCCATGTTTACCGCGGGGTGACG 
GAGTGGCTCACGACCACGGATCACAAGCGGATAGGTCTCATGTTCTACGC 
CGTCATCGTCGGGAAAGCTTCTTCCTCGGAGGCATATTCGCCCTCATCAT 
5 GCGGACCGAGCTCCTCACGCCCGAG CGGACCATCATCGACGCGGCGACCT 
ACAACCGGATGTTCACGCTGCACGGCGTGATCATGGTCTGGCTGTTCATG 
ATCCCGTCGATCCCCAACGCGTTCGGCAACTTCGTCCTGCCGATCATGCT 
CGGCGCCAAGGACCTCGCGTTCCCCCGGATCAACCTCGCGAGCTTCTACA 
TCTACCTCCTCGGGGCGGCGATCGZGATGGGCGGCATGATCGCGGGCGGC 
20 x o ACGGACACCGGCTGGACGTTCTAC CCGACGTACAGCCTGAAGACGCCGAT 

GACGCTGTTCCCGGTCGTCTTCGGCATCTTCATCGTCGGCGTCTCGTCCA 
TCATGACGGCGGTCAACTTCATCG73ACCACGCACACGATGCGCGCCGAG 
GGGCTCACGTGGAGCCGCCTGCCGCTCTTCGTCTGGAGCACCTACGCGAC 
GAGCATCATCCTGCTCTTCGCGAC Z CCGGTCCTCGGGCTCTCGATCCTGC 
1 5 ' TCATCGGCATCGACCACGTGACCGC3CTCGGGATGTTCGATCCCCGGTTC 
GGCGGCGATCCGGTCCTCTTCCAGCACCTCTTCTGGTTCTACTCCCACCC 
CGCCGTCTACATCATGATCCTGCCGGCGTTCGGCGTGGTGAGCGAGGTCG 
TCTGCACGTTCGCGCACAAGCGCCCCGCGTCCTACTGGGCGATCGCCATC 
TCGTCGCTCGGGATCGCGTTCGTGGGGTTCTGGACGTGGGGCCACCACAT 
20 GTTCGTGGCGGGGATGAGCGAGTACGCCGCGGACGTCTTCGGCGTGCTCT 
35 CGATGTTCGTGGCCATCTTCTCGG CCATC AAGGTCTACACGTGGGTCGCG 

ACGCTGTACAGGGGCTCGATCCACTTCAACACGCCGCTGCTCTACTTCAT 
CGCCTTCCTCTTCCTGTTCGTCTTCGGGGGGATGACGGGCGTGGCCGTCG 
CCACGCAGTCGCTGGACGTGCACTGGCACGACACATACTTCGTTGTGGCG 
2 5 CACTTCCACTTCATCATGGTGGGCGGGACGCTCACCATGTTCCTCGCGGC 
GGCGCACTACTGGTTTCCGAAGAT37TCGGGCGCCTCTACTCGGAGCGCG 
TCGGGCTCCTCTCGGCCGCGTCGG7GTTCCTCGGCTTCTTCTTGACCTTC 
45 TTCCCGCAGTTCCTCCTCGGGAACATGGGGATGCCCCGCCGCTATTACAG 

CTACCCGCCGCGCTACCAGTGGC7CCACGTGCTCTCGACCGGCGGCGCCT 
30 ACCTGCTCGCCGCGGCGCTCGTGATCTCGCTCCTGAACCTCGTCATCGCG 
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CTCAAGTGGGGCCGGAAGGCCGGGAGGAACCCCTGGGGCGGGCGCACGCT 
CGAGTGGATGACCGGCGAGCCCTTGCCGCCCAAGCACAACTTCCCGGTCG 
CGCCGCTCGTCCGCCGCGGCCCGTACGAGTTCCAGCTCTCCGAGGAGGAC 
GCCCGTGCGACAACCACGCCCGCTGCGTGAGCAQTTCGAAGATCTCGAGA 
5 AGCAGACGCACGCGGCCCGCCTCGGGATGTGGTTGTTCCTCGGGAGCGAG 
GTGCTCCTCTTCACCGGGCTCTTCGCGCTGTACGCGGCGTACCGCGAGCT 
CTACCCGCGCGATTTCGCCGAGGCCATCGCGCACAACAACGTCGCGATCG 
GCACCACCATGACGCTCATCCTGATCGGCAGCAGCTTCACCGTCGCCATG 
GCGGTGCACGCCGTCCGCGCCTCCCACCCGCGGCGCGCCGCGCTGTTCCT 
1 0 CGCGGTGAGCGTGGCGATCGGGATCGTGTTCCTCGTGCTGAAAGGGATCG 
AGTACGCGCAGCACTTCCGCGAGGGCATCTTCCCGGCCGGCGCCTACCGC 
TTCGCGGAGCTCCCGACGTTCGGCGCGCAGATGGCGTTCAC'GCTGTACTT 
CGCCATGACGGCGCTCCACGCCCTGCACGTCGTGGGCGGGGCCGGCCTCC 
TCACGGGGGTCGCGTGGGGGTGCTGGAAGGGCCGGTACTGGGCATACGAC 
1 5 CAGACGCCGGTGGAGCTCAGCGGCCTCTACTGGCACCTCGTGGACATCAT 
GTGGATCTTCATCTGGCCGCTCCTCTACCTGACGCGCAAGTGACGGCACA 
GCCCGGAGACGACCATGCCGCAAGAGCACGTCGCGGAAAGCACGCCCTGG 
ACCCGTTACCTCATGGACGCTGATGGCCCTCATTGCCCTCACGCTCCTGT 
CGTTAGCGCTCTCGTTCTTGCGCACGGGGGCTTGGGAAATACCGATCGCG 
2 0 CTGCTCCTCCCCGTGGTGAAGAGCGTGCTCGGGCTTGCTTTCTTCTTGCT 
CCGTCGAGGTGCATTAAGGTCATCAAACGCCTTTTTCAAACATTGGCTGC 
TGGTGTACTTACTTGGACGTTTCATACTGTCTCTTTTATGGCCCCCAACG 
GTCATTACGTCGACCTACACAATTTCTCTTCCCGCACCCTCCTACCTGTA 
TCTCTAAGCACTGCCTTGCGTCCTGCTCTATTACATTCTACTCCGGCTGT 

2 5 CCATGTGTGGGATTATATGCGCGAGGTACCTATTCCGCCGTGGAGTCTCC 

ATTTACCTCTTGGACCTTGCCCGTTCTGAATCTCGATCTCCTCATGCGTT 
GGTCCACCATGTATTACCTCCTAGAATCTTATACTCCATATCTCTATATA 
TCTAGTTGTGCGTGTAATTGTGTCATATATTATCGCCACTGCTGTATGAA 
TACCGTGCCGACGTGCTATATACGAAANTACTCCTCGGTCGATATCTCCA 

3 0 CeTCATATATACCTCCGAGTGTAGTATACGCACGAGTGTATATACTCTTC 



WO 00/22139 PCT/US99/23535 
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CTCTGGTCACGCGACTTCGTGCTGATATGATACCATCGTTCCATGTTACG 
CGAAGTTACTCATAAGATCTCCTCACACATCAACGAGTGTACTCCTATGT 

10 

GTTTCATACAAACTCGATACCCTTCAGAGTAGTGTCATGCCTATGTGGTA 
TGCATAATGTTAGTATACTTT 

5 

15 S3Q ID No 7 9 (>Contig54) 

TGGGAAAGAGGGCCACAGGGGATGTAGCAGGACGCTTAATAGTAAATGAC 
GAGGGTGTGCCGACGAGACCCGTAGGAAACAACGGGCACAGACGAGAGCA 
ATAAAGGGGGTTGGAAGGTAC CC CGG ATAGAGTAGAGAAGGCTAGCGGAC 

20 10 GAGTAAGACGCGGAGGAAATAAGTCGGCGTCGTAGAAGTTCTGTGGAGAA 

GGTACGACTCTTAAAGACCTAGGCGGGAGACAGTTTCCACCCGAGGCAGA 
GCAAGACCACAAGATTCAGAGGGAGTAAGGAGTTCCGAATTGGAGAGGTT 
GAGGGGCGTGTGAGCCGTCAAGTGGGGCGCGTACGCAAAGAAAGAAGCGT 

25 

CCATGTCAGAGGCCCAGCGGCCGTTGCGGCCCTACATGGGTGATCACGGT 
15 GGGCCGTAGGCGGACCGGAGATGAGCGCGGCTCCGCCCACCGGACGGCGA 
GGGGCACGGCGCCTTCGTCCGGGGCGCTCGCGCGCGATCTCGCGCGCGCG 
30 GGGGGCGCCGGCGTCGCCGTCGTCGGCGACGGCCAGCCGCCCATCGTCCA 

CGCCCTCGGGCACGTCATCAACGCCGCGCTCCGCAGCCGGGCGGCCTGGA 
TGGTCGATCCTGTGCTGATCGACGCGGGCCCCTCCACGCAGAGCTTCTCC 
20 GAGCTCGTCGGCGAGCTCGGGCGCGGCGCGGTCGACACCTTGATCCTCCT 
35 CGACGTGAACCCCGTGTACGCCGCGCCGGCCGACGTCGATTTCGCGGGCC 

TCCTCGCGCGCGTGCCCACGAGCTTGAAGGCCGGGCTCTACGACGACGAG 
ACCGCCCGCGCTTGCACGTGGTTCGTGCCGACCCGGCATTACCTCGAGTC 
GTGGGGGGACGCGCGGGCGTACGACGGGACGGTCTCGTTCGTGCAACCCC 

40 

25 TCGTCCGGCCGCTGTTCGACGGCCGGGCGGTGCCCGAGCTGCTCGCCGTC 
TTCGCGGGGGACGAGCGCCCGGATCCCCGGCTGCTGCTGCGCGAGCACTG 
GCGCGGCGAGCGCGGAGGGGCGGATTTCGAGGCCTTCTGGGGCGAGGCAT 
45 TGAAGCGCGGCTTCCTCCCTGACAGCGCCCGGCCGAGGCAGACACCGGAG 

CTCGCGCCGGCCGATCTCGCTCAGGAGCTCGCGCGGCTCGCCGCCGCGCC 
30 GCGGCCGGCCGGCGGCGCGCTCGACGTGGCGTTCCTCAGGTCGCCGTCGC 
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TCCACGACGGCAGGTTCGCCAACAACCCCTGGCTGCAAGAGCTCCCGCGG 
CCGATCACCAGGCTCACCTGGGGCAACGCCGCCATGATGAGCGCGGCGAC 
CGCGGCGCGGCTCGGCGTCGAGCGCGGCGATGTCGTCGAGCTCGCGCTGC 
GCGGCCGCACGATCGAGATCCCGGCCGTCGTCGTCCGCGGGCACGCCGAC 
5 GACGTGATCAGCGTCGACCTCGGCTATGGGCGCGACGCCGGCGAGGAGGT 
CGCGCGCGGGGTGGGCGTGTCGGCGTATCGGATCCGCCCGTCCGACGCGC 
GGTGGTTCGCGGGGGGCCTCTCCGTGAGGAAGACCGGCGCCACGGCCGCG 
CTCGCGCAGGCCCAGCTCGAGCTCTCCCAGCACGACCGTCCCATCGCGCT 
CAGGAGGACGCTGCCGCAGTACCGTGAACAGCCCGGTTTCGCGGAGGAGC 
20 10 ACAAGGGGCCGGTCCGCTCGATCCTGCCGGAGGTCCAGCACACCGGCGCG 

CAATGGGCGATGTCCATCGACATGTCGATCTGCACCGGGTGCTCCTCGTG 
CGTCGTGGCCTGTCAGGCCGAGAACAACGTCCTCGTCGTCGGCAAGGAGG 
AGGTGATGCACGGCCGCGAGATGCAGTGGTTGCGGATCGATCAGTACTTC 
GAGGGGGGAGGCGACGAGGTGAGCGTCGTCAACCAGCCGATGCTCTGCCA 
1 5 GCACTGCGAGAAGGCGCCGTGCGAGTACGTCTGTCCGGTGAACGCGACGG 
TCCACAGCCCCGACGGCCTCAACGAGATGATCTACAACCGATGCATCGGG 
30 ACGCGCTTTTGCTCCAACAACTGCCCGTACAAGATCCGGCGGTTCAATTT 

CTTCGACTACAATGCCCACGTCCCGTACAACGCCGGCCTCCGCAAGCTCC 
AGCGCAACCCGGACGTGACCGTCCGCGCCCGCGGCGTCATGGAGAAATGC 
20 ACGTACTGCGTGCAGCGGATCCGAGAGGCGGACATCCGCGCGCAGATCGA 
35 GCGGCGGCCGCTCCGGCCGGGCGAGGTGGTCACCGCCTGCCAGCAGGCCT 

GTCCGACCGGCGCGATCCAGTTCGGGTCGCTGGATCACGCGGATACCAAG 
ATGGTCGCGTGGCGCAGGGAGCCGCGCGCGTACGCCGTGCTCCACGACCT 
CGGCACCCGGCCGCGGACGGAGTACCTCGCCAAGATCGAGAACCCGAACC 

40 

2 5 CCGAGATTGAATGAGCCATGGCGGGCCCGCTCATCCTGGACGCACCGACC 

GACGATCAGCTGTCGAAGCAGCTCCTCGAGCCGGTATGGAAGCC3CGCTC 
CCGGCTCGGCTGGATGCTCGCGTTCGGGCTCGCGCTCGGCGGCACGGGCC 
45 TGCTCTTCCTCGCGATCACCTACACCGTCCTCACCGGGATCGGCGTGTGG 

GGCAACAACATCCCGGTCGCCTGGGCCTTCGCGATCACCAACTTCGTCTG 

3 0 GTGGATCGGGATCGGCCACGCCGGGACGTTCATCTCCGCGATCCTCCTCC 
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TGCTCGAGCAGAAGTGGCGGACGA3CATGAACCGCTTCGCCGAGGCGATG 
ACGCTCTTCGCGGTCGTCCAGGCCC-GCCTCTTTCCGGTCCTCCACCTCGG 
CCGCCCCTGGTTCGCCTACTGGATC TTCCCGTACCCCGCGACGATGCAGG 
TGTGGCCGGAGTTCCGGAGCGCGCTGCCGTGGGACGCCGCCGCGATCGCG 
5 ACCTACTTCACGGTGTCGCTCCTGTTCTGGTACATGGGCCTCGTCCCGGA 
15 TCTGGCGGCGCTGCGCGATCACGCCCCGGGCCGCGTCCGGCGGGTGATCT 

ACGGGCTCATGTCGTTCGGCTGGCACGGCGCCGCCGACCACTTCCGGCAT 
TACCGGGTGCTGTACGGGCTGCTCGCGGGGCTCGCGACGCCCCTCGTCGT 
CTCGGTGCACTCGATCGTGAGCAGCGATTTCGCGATCGCCCTGGTCCCCG 
20 io GCTGGCACTCGACGCTCTTTCCGCCGTTCTTCGTCGCGGGCGCGATCTTC 

' TCCGGGTTCGCGATGGTGCTCACGC7GCTCATCCCGGTGCGGCGGATCTA 
CGGGCTCCATAACGTCGTGACCGCGCGCCACCTCGACGATCTCGCGAAGA 
TGACGCTCGTGACCGGCTGGATCGf CATCCTCTCGTACATCATCGAGAAC 
TTCCTCGCCTGGTACAGCGGCTCGGCGTACGAAATGCATCAGTTTTTTCA 
1 5 GACACGCCTGCGCGGCCCGAACAACGCCGCCTACTGGGCCCAGCACGTCT 
GCAACGTGCTCGTCATCCAGCTCCTCTGGAGCGAGCGGATCCGGACGAGC 
30 CCCGTCGCGCTCTGGCTCATCTCCATCCTCGTCAACGTCGGGATGTGGAG 

CGAGCGGTTCACGCTCATCGTGATGTCGCTCGAGGAAGAGTTCCTCCCGT 
CCAAGTGGCACGGCTACAGCCCGACGTGGGTGGACTGGAGCCTCTTCATC 
2 0 GGGTCAGGCGGCTTCTTCATGCTCCTGTTCCTGAGCTTTTTGCGCGTCTT 
35 TCCGTTCATCCCCGTCGCGGAGGTCAAGGAGCTCAACCATGAAGAGCTGG 

AGAAGGCTCGGGGCAAGGGGGGGC3CTGATGGAGACCGGAACGCTCGGCG 
AGTTCGACGACCCGGAGGCGATGCTCCATGCGATCCGAGAGCTCAGGCGG 
CGCGGCTACCGCCGGGTGGAAGCGTTCACGCCCTATCCGGTGAAGGGGCT 
2 5 CGACGAGGCGCTCGACCTCCCGCG-TCGAACCTCAACCGGATGGTGCTGC 
CCTTCGCGATCCTGGGGGTCGTGG3CGGCTACTTCGTCCAGTGGTTCTGC 
AACGCTTTCCACTATCCGCTGAAC3TGGGCGGGCGCCCGCTGAACTCGGC 
45 GCCGGCGTTCATCCCGATCACGT7CGAGATGGGGGTGCTCTCCACCTCGA 

TCTTCGGCGTGCTCATCGGCTTTTACCTGACGAGGCTGCCGAGGCTCTAC 
30 CTCCCGCTCTTCGACGCCCCGGGCrTCGAGCGCGTCACGCTGGATCGGGT 
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CCTGGTCGGGCTCGACGACACGGAACCTTCCTTCTCGAGCGCCCAGGCGG 
AC-CGCGATCTCCTCGCGCTCGGCGCCAGGCGCGTCGTCGTGGCGAGGAGG 
CGCGAGGAGCCATGAGGGCCGGCGCCCCGGCTCGCCCCCTCGGGCGCGCG 
CTCGCGCCGTTCGCCCTCGTCCTGCTCGCCGGGTGCCGCGAGAAGGTGCT 
5 GCCCGAGCCGGACTTCGAGCGGATGATCCGCCAGGAGAAATACGGGCTCT 
GGGAGCCGTGCGAGCACTTCGACGACGGCCGCGCGATGCAGCACCCGCCC 
GAGGGGACCGTCGCGCGCGGGCGCGTCACCGGGCCGCCCGGCTATCTCCA 
GGGCGTCCTCGACGGGGCGTACGTCACGGAGGTGCCGCTCTCGCTCACGG 
TCGAGCTCGTGCAGCGCGGCCGGCAGCGCTTCGAGACCTTCTGCGCGCCG 
20 io TGCCACGGGATCCTCGGCGACGGCAGCTCGCGCGTGGCGACGAACATGAC 

GCTGCGCCCGCCGCCGTCGCTCGTCGGACCCGAGGCGCGGAGCTTCCCGC 
CGGGCAGGATCTACCAGGTCATCATCGAGGGCTACGGCCTGATGCCGCGC 
TACTCGGACGATCTGCCCGACATCGAAGAGCGCTGGGCCGTCGTCGCCTA 
CGTGAAGGCGCTTCAGCTGAGCCGCGGAGTGGCCGCGGGCGCCCTCCCGC 
1 5 CCGCGCTCCGCGGCCGGGCAGAGCAGGAGCTGCGATGAACAGGGATGCCA 
TCTAGTACAAGGGCGGCGCGACGATCGCGGCCTCGCTCGCGATCGCGGCG 
30 CTCGGCGCGGTCGCCGCGATCGTCGGCGGCTTCGTCGATCTCCGCCGGTT 

CTTCTTCTCGTACCTCGCCGCGTGGTCGTTCGCGGTATTCCTGTCCGTGG 
GCGCGCTCGTCACGCTCCTCACCTGCAACGCCATGCGCGCGGGCTGGCCC 
20 ACGGCGGTGCGCCGCCTCCTCGAGACGATGGTGGCGCCGCTGCCCCTGCT 
35 CGCGGCGCTCTCCGCGCCGATCCTGGTCGGCCTGGACACGCTGTACCCGT 

GGATGCACCCCGAGCGGATCGCCGGCGAGCACGCGCGGCGCATCCTCGAG 
CACAGAACGCCCTACTTCAATCCAGGCTTCTTCGTCGTGCGCTCGGCGAT 
CTACTTCGCGATCTGGATCGCCGTCGCCCTCGTGCTCCGCCGGCGATCGT 

40 

2 5 TCGCGCAGGACCGTGAGCCGAGGGCCGACGTCAAGGACGCGATGTATGGC 
CTGAGCGGCGCCATGCTGCCGGTCGTGGCGATCACGATCGTCTTCTCGTC 
GTTCGACTGGCTCATGTCCCTCGACGCGACCTGGTACTCGACGATGTTCC 
45 CGGTCTACGTGTTCGCGAGCGCCTTCGTGACCGCCGTCGGCGCGCTCACG 

GTCC-CTCGTATGCCGCGCAGACGTCCGGTTACCTCGCGAGGCTCAACGA 
30 CTCGCACTATTACGCGCTCGGGCGGCTGCTCCTCGCGTTCACGATATTCT 
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GGGCCTATGCGGCCTATTTCCAGTTCATGTTGATCTGGATCGCGAACAAG 
CCCGACGAGGTCGCCTTCTTCCTCGACCGCTGGGAAGGGCCCTGGCGGCC 
GACCTCCGTGCTCGTCGTCCTCACGCGGTTCGTCGTCCCGTTCCTGATCC 
TGATGTCGTACGCGATCAAGCGGCGCCCGCGCCAGCTCTCGTGGATGGCG 
5 CTCTGGGTCGTCGCCTCCGGCTACATCGACTTTCACTGGCTCGTGGTGCC 
GGCGACAGGGCGCCACGGGTTCGCCTATCACTGGCTCGACCTCGCGACCC 
TGTGCGTCGTGGGCGGCCTCTCGACCGCGTTCGCCGCGTGGCGGCTGCGA 
GGGCGGCCGGTGGTCCCGGTCCACGACCCGCGGCTCGAAGAGGCCTTTGC 
GTACCGGAGCATATGATGTTCCGTTTCCGTCACAGCGAGGTTCGCCAGGA 
20 10 GGAGGACACGCTCCCCTGGGGGCGCGTGATCCTCGCGTTCGCCGTCGTGC 

TCGCGATCGGCGGCGCGCTGACGCTCTGGGCCTGGCTCGCGATGCGGGCC 
CGCGAGGCGGATCTGCGGCCCTCCCTCGCGTTCCCCGAGAAGGATCTCGG 
GCCGCGGCGCGAGGTCAGCATGGTCCAGCAGTCGCTGTTCGACGAGGCGC 

25 

GCCTGGGCCAGCAGCTCGTCGAGGCGCAGCGCGCGGAGCTCCGCCGCTTC 
1 5 GGCGTCGTCGATCGGGAGAGGGGCATCGTGAGCATCCCGATCGACGACGC 
GATCGAGCTCATGGTGGCGGAGGGCGCGCGATGAGCCGGGCCGTCGCCGT 

30 GGCCCTCCTGCTGGCAGCCGGCCTCGTGTCGCGCCCGGGCGCCGCGTCCG 

AGCCCGTATCGCTTTCGCCCCGCGCTGGGCCCGTCCGCGGGCGAGGCCGC 
GCTCTGAAACGACGGCTCCGGCGCGGATGAGCGGCCCGAGGCGACCTCCT 
2 0 GC AACCCACCGCGCTGCGTACAGGGGTAAATCAACTGCATTCCAGGATAC 

35 GGACCGCGCAGCATAAACCCTCACCGGACAGACTGAATAATGCCGAACCT 

TGAACTTCTATGCCCATGCGTGCGGGATCATCATGCCCAACTATTTAAAC 
TGGTCCCTCCGGCAAAAGGAACGGACCAGCACCCAGAATAACCCCTGTTT 
GGCCAGCGCAAAAAAATGAAAACTCTTCTCCTTGCGGCTAAAATAAACGA 

40 

2 5 CTCCGGGGAACCGAATGGTATGCAATACACCATGGCAACGCGATTCGTCC 

TAACCTTAAACAAACTTCCCAGCGACTCGTCCGTCGAACGCTTCTGACGC 
AAGACCATGACCCCACGAGAACCGGGCGGCGGACACACTGCCAGTGAAAC 
45 TCGGCCTAGGCCCGCCCTGCCTTCACGTATTCACCGTGGGGCGGTCCAAA 

ACTAAAAACAATACGTGACTCTTCTCAAATATCGTCGGATAAAGGCCAAC 

3 0 ACGCGTACCTCCCCCTAAAGGGAAGAAACCCCTACCAGGGTGGACCGTAT 
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CCACCCGTGATCCCTGAAACATCTCCAGCCGCGTACAAATTAGGCTTTGA 
CAAAACC 



SEQ ID No 80 (>Contig55) 
5 GGGGAAAGAAGTCGGGAGCAGCAGAAGAGCGCAGCGATTGAAAGCACGAC 
GCAAGGGCAAAGGACATNAGACGACAACAGAAAGGCGAAGACATGAGAAG 
AGGAGAAAGACTCGGAAATGGGCACAAAGCGAGAAAAAACGTAACGCTAT 
CTGAGAAAGAACACCATAAAGTCAGACTGGGGTAAAGCCATACACGCAGC 
GAAAGCAGACAAGCAAAAGTCATTAACAGATGCAACAAAGAACAAAAGAG 
20 10 AAAGGAAGGACACATGAAGGAGAAAGGCGGCTTCGAGAACTAAAGACGGG 

AGCCAACAATATACCTTTCACAGGGGCGAAGAAGGGCCCAAGGTTCAATC 
GAAGACGATTGAATCCAAGGAAGTCCAATCGGAAGAAAAAAGCATATTGA 
TAAACCAGACAACGACAGCAGGCCTGAACGTAGGAGCGAGATCGTGAGAC 
ATCAGTAGGCAAAACAAGAGCGCTACACCCAGGGGTCGTCAACCTAGAAA 
1 5 GGCGCGTCCTCAAGCCGGTAGCGGCCGCGCGCGACCAGCCCGATGCGGGC 
GCCTTCGCGCGCGAACCGCCGCACCGTCGCGCGCCCCACGCCCGCTGACG 
CCCCCGTGATCACCACGACCTCCGGCCGCCTGGAATCGCCCATGCCCGTC 
GCCTCCCGCCTCGCCCGCGGCGTCAAGCAACGTGAATGCCACCTGAGCGT 
GTCACTTCCTCAAGCTCGACAGCACGTCCTTGATCCGCTCGGTCGGACCC 
2 0 GCCGTGAGCGACCAGGGGCTCGGCTGCCGCACGACGTGCAGGTCGCCTCG 
ACGCTCGAGCACCTCGAACCGCGTCGTTCCCTGCTCGGTCCGGACGAACC 
GGAGCGCCACCACGGCCGTGCCCACATGGAGGTTCGAGAGGGTGATCTCG 
GGCAGCCACGCCGGCAGCGCCGGGTCGACCAGCAGCAGGTCGAGCGGCGC 
GAACGGGTACAGCCCGAGCATCGCCTGGAGCATCGCGAACACGCTCGAGC 

2 5 AGGACCAGGCCTGCGGCCAGTTGGCCTTCGGGTACAGCGCGGGGAACGGG 
TGCTCCGCGTCGCGCGGGTGGCCGCTCCAGCACTCGGGCAGCCGGTGGTG 
CTCGAACAGCGCCGCCGCCTCGAACACGGCGCGGCACAGGAGCGCCACGT 

45 GCCCGTGCAGGCCGTACCGCGCGAGCCCGAGCGCGATCGCGCCCTGATCG 

ACGGGCCAGACCGTCCCCCGGTGATAGCTGTACGGATCGAACGCGGGGTG 

3 0 ATCGGCCGAGAGCGTGCGGATACCCCAGCCGGAGAACATGTCCGCCGCGA 
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ACATGCGGATGGCGGTGGGCTCGGCGAGGGCCTGGTCCACGATCCCCGCC 
GCGAGGCAGAGCCCCGGATCCGAGCCGATCGAGCGGATCTGGCGCTTGTC 
CGGGCCGAGGCCCATCGCGAAGGTGCGCGCGTCGGGCATCCAGAAGGCGT 
CGTTGAACCGCCTCTGCAGCTCGAGCGCCTCGGCGAACAGCCGTCGCGCG 
5 TCGTCCTTGCGGCCGAACCAGAAGAGCAGCTCGGAGAGGCGCAGCTTCGA 
CAGGAACACGAAGCCCTGCATCTCGCACGTCCCGATCGGCGGCCGCACCT 
GAGAGCCGTCGGCGTGGACGATGGCGTCGTCGGAGTCCTTCCAGCCCTGG 
TTCTGGATCGACGCGCTCGAGCGGGGCTCGTACTCGTAGAACCCGTCGCC 
GTCGAGATCGCCCTCCTCGTCGATCCAGCGCATGGCCCTGAGCGCAGGCT 
20 10 CGATCAGGCGGCCGACGCGCTCGCGATCGCCGGTCCAGTGCCAGAGCTCC 

GAGGCGGCCACCGCGTAGAACATCC-TCGAGGTCGCCGACGCGTACGTGCG 
CCCCAGCGGGTTGTAGTTGAGGTCCGAGAGCGCTCCGTCCCTGGCCTGAT 
GCAGCATCCGGTCGGGCTGCTCGTCGCGCCAGTCGTCGACGACGCGCCCC 

25 

TGCCAGCGCGGGAGCACGAGCGCTGTGCCTGCGAGGATGTCGGTCGTCAG 
15 CGCGGCGGCCTGCGTGCCCGCGGCGAGCGGGTCGCGGCCGAAGAGCCCGA 
TGTAGATCGGCAGTCCGGCGGCCACCGTCCAGGAGCGCTCGTCCTGGTCG 

30 ATATCGTACATGCGGAGCGCGATGAGATCGCGCTTGGCTCGCTCGAGCAC 

GGAGAAGACCGTATGCGAGAGCGTGTCGGCGCCCGGGACCGAGAAGGACG 
TGGCGCGGTGGTGGAAGGTGCCGCGCGCCACGTCGCGCGCGTTTCGCGTG 
2 0 CCGAAGAACGAGCGGCAGCCGGCGAGCAGCGGGAGCGGTTCGCCGCGGAT 

35 CAGGGCGATCACGTCGACGCAGCCGCGCCAGGTGCCATGGGGCTCGAGAT 

CGATCGAGAACCGGATCTCGCGGCCAGCGCAGGACGGGGGCGAGCCCGCG 
CTCCGGGCCCGTACGACGATCCCGGCGTCGAAGCGCGCGACGCCGGATTC 
TCCGGGGTGTTCATAGCGGTGCTCGGCGCGGTAGTCGCAGCGCAGCGCGC 

40 

2 5 ATCCGTGCTCGGCGGGCTCGAGCGCCCACCGGGCGTCGCCGCGCTGGAGG 
CGCGGGCCGTCCGTCTCCTCCGCGTCGGCGAAGTCGGCGTCGATCTCGAG 
CGCCAGCGTGAAGCGCACGCGCTCCTGCGTGAAGCTCGCGACGTCGATGT 
45 CCTCGTGGAAGCCGTCCCCCACGAAGCGCGACAGCCGCAGCTCGACGGTC 

CGCGGCGCCGCGCCGCCGCCCTCGGGCGGCGGGGCGATGTAGTAGCCGAG 
30 CCAGCTGTCCGGCTCGACGGCGGAGAGCGCGACGGGCGCCGGCGGCCTGC 
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CGTCGATCAGGTGGCGGTAGAGGGAGAGGAGGCGGGTGTTGCGCACGAAC 
AGGCCGATGTGCGGCTCGGGGGCGATCGAGCCGTCCGGACGCATGCACAG 
GACCGTCCGGTTCTGGCTGATGTACAGGGAGCCGGCGCGGGGCCTCAGCG 
TGGCCAGCGAGCCGAAGGGTCTCTCGAGCGACATGGTCACCTCCAGGCGA 
5 GCCCGGGGTGAGGACTGCCAGGGGCGTGCCAACGACGTGGACGCGCTCAC 
15 GCCAAGCCGCTGGGCGGGCGCGGCGGCCCCTGCGCCCGAGGCTCAGCCGA 

GCGCGAGCGGTGTTTCGTCGGCGCGAGCGGGCCTCGTCGTCGCCTCCAGG 
TAGGCCTTTCCGATGTCGGCCTCGTCCACGAACCCGACGATCGCGCCCTC 
GCCGTCGACGACGGGGACCTCGCGGACGCCGTGCGCCACCATCGCCTCGG 
20 io TCGCCGTCCGCAGATCGTCGGTGACCGTCACGGCCACCGGCGGCTGCATC 

' GCGTCGGCGGCCACGGTCATCCGCTCGAGGTCGTGCTCCACCGCGATGAT 
CCGGAGCGACTCGGCGGTGATCATGCCGACCATCTTGCGCGACGGTTCGA 
GCACCGGGAACACCTCCTGCCAGCTCGCGTCGGCCGCCCGCCGGAGCATC 
TCGCGGGCCGGCGTCCCCGGCACGAACGTCACGAGCGCGCGCCCCTCGAT 
15 CATGATCTGCCGCACGCGGATGGTCTTGAGCACGTCGAGCGTCGGCACCG 
GGTGCGCGGGAGACTCGCGCTGGGTGGGGAGCTGCGCGTGGTAGAGCGAG 
TGCTTCCGCAGCGCGACGAAGGCGACGCCCTCGGCGAGCATCAGCGGAAC 
CAGGAGGTCGTAGCTGCCGGCGAGCTCGCAGACCATCACGAGGGAGCTCA 
CCGGCACGTGCGCGACGCCGCCGTAGAAGGTGCCCATGCCCACGAGCGCG 
2 0 AAG3CGCCCGGGTCGATGCGCGGATCGCCGAGCAGGAGCGCCGCCGCGCG 
35 CCCGAACGCGCCGCCGAAGAGCCCGCCGATGACGAGCGACGGCGCGAAGT 

CGCCGGCGCACCCGCCGCTGCCGAGCGTGAGCGACGAGGCGACGATCTTG 
GCGGCGCAGAGCAGGAGCAGGAGCTCCACGCCGCGCCAGCCCGGGTGGAG 
CCACGTGGCGCCGGTGATCGCGACCTGGACGGCGCCGTACCCGCCGCCGA 

40 

25 GCAGCCCGAGCCCTTGCCCGGGGCTCTCGATCCTGCGGCCGACGAACCAG 
AGGACGGGCACGCAGAAGAGGCCCAGCGCGAGCCCGCCGAGCCCCGGGCG 
CGCCCAGGGGGCGATGGGCAGGCGCGCCGCGATCGCCTTCACGCCGCCGA 
45 GGCACTTCAGGAAGCCGATCGCGACGAAGGCCAGGAGCAGCGCGAGCAGC 

GCATAGAGAGGGAGGTGCGACGGGACGAACGCATACTTCGGCGCGTGCGC 
3 0 GAAGAGCGTCGACTCGCCGTAGAACGAGATGAAGACCGAGTAGGAGACCA 
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CGCTGGCGAGCAGCGCCGGGATCAGCGCCTCGGCCTCGAAGTCGTCGCGG 

TAGAGCACCTCGACGGCGAGCAGGGCGGCGCCGAGCGGCGTGCGGAAGAT 

GGCGGACATCCCGGCCGCGACCCCCGCGAGCATCAGGATGCGGTGCTCGC 

GCCGGCCGACCGCGAGCCCGCGCCCCACGAGCGAGCCGAGCGCTCCTCCG 

ACCTGCATGGTCGGCCCCTCGCGACCGCCGGCGCCGCCCGAGCCGAGCGT 

GAGGATCGACGCGACCGCCTTGACCCACGCGACCCGCTTGCGCATCCGGC 

CGCCGTGGTGGTGGAAGGCCTGGATCATCGCGTCGCCGCCGCCGCCCGCG 

GCCTCGGGGGCGAGGCGCCAGGTGAGGATGCCCCCGGCCAGCGCTCCGAG 

CGCCGGGATCAGCAGCAGCAGCCAGAGCCGGACGCTCCGGTGCTCGGGGC 

CGTCGCCGCTGAAGATGGCCTCGCCGTGGGCTCGAAGCCGCGCGTAGCCG 

GCGAGGCGGCCGAGGAGCAGCTCCTCGACGAGCTCGAGGGCGCCGAAGAA 

GAGCACCGCGACGAGGCCCGCGATGGCCCCGACGAGCACGGCATGCAGGA 

TCGTGCGCCCGACGAGCCTGAGATCGAGGGGCGCGACCTCCGAGAGGAGC 

GCGGAGAAGGGGCGCCGCCGTCGCACCACGCCGGCGCGCTCTGCTGGGAA 

TGGTTCCGTCACGGCGATGGT 
SEQ ID No 81 (>Contig56) 

GGATCCGGCCGCGAGCGGCTGCGGTGCCGATGCGCTCGACGGTGACGGGC 

GGGGTGATCGCGGGTCCGGAGCTCGGTGCGAGCTACTGGGCGGACAACCT 

TCGGCAGCCGGTGCGCTTCGCTGCGGCGGCGCAAGCGCTGCTGGAGGGTG 

GCCCCGCGCTGTTCATCGAGATGAGCCCGCACCCGATCCTGGTGCCGCCC 

CTGGACGAGATCCAGACGGCGGCCGAGCAAGGGGGCGCTGCGGTGGGCTC 

GCTGCGGCGAGGGCAGGACGAGCGCGCGACGCTGCTGGAGGCGCTGGGGA 

CGCTGTGGGCGTCCGGCTATCCGGTGAGCTGGGCTCGGCTGTTCCCCGCG 

GGCGGCAGGCGGGTTCCGCTGCCGACCTATCCCTGGCAGCACGAGCGGTG 

CTGGATCGAGGTCGAGCCTGACGCCCGCCGCCTCGCCGCAGCCGACCCCA 

CCAAGGACTGGTTCTACCGAACGGACTGGCCCGAGGTGCCCCGCGCCGCC 

CCGAAATCGGAGACAGCTCATGGGAGCTGGCTGCTGTTGGCCGACAGGGG 

TGGGGTCGGTGAGGCGGTCGCTGCAGCGCTGTCGACGCGCGGACTTTCCT 

GCACCGTGCTTCATGCGTCGGCTGACGCCTCCACCGTCGCCGAGCAGGTA 
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TCCGAAGCTGCCAGTCGCCGAAACGACTGGCAGGGAGTCCTCTACCTGTG 
GGGCCTCGACGCCGTCGTCGATGCTGGGGCATCGGCCGACGAAGTCAGCG 

W 

AGGCTACCCGCCGTGCCACCGCACCCGTCCTTGGGCTGGTTCGATTCCTG 
AGCGCTGCGCCCCATCCTCCTCGCTTCTGGGTGGTGACCCGCGGGGCATG 
5 CACGGTGGGCGGCGAGCCAGAGGCCTCTCTTTGCCAAGCGGCGTTGTGGG 
15 GCCTCGCGCGCGTCGCGGCGCTGGAGCACCCCGCTGCCTGGGGTGGCCTC 

GTGGACCTGGATCCTCAGAAGAGCCCGACGGAGATCGAGCCCCTGGTGGC 
CGAGCTGCTTTCGCCGGACGCCGAGGATCAACTGGCGTTCCGCAGCGGTC 
GCAGGCACGCAGCACGCCTTGTAGCCGCCGCGCCGGAGGGCGACGTCGCA 
*° ! o CCGATATCGCTGTCCGCGGAGGGGAGCTACCTGGTGACGGGCGGGCTGGG 

TGGCCTTGGTCTGCTCGTGGCTCGGTGGCTGGTGGAGCGGGGAGCTCGAC 
ATCTGGTGCTCACCAGCCGGCACGGGCTGCCAGAGCGACAGGCGTCGGGC 
GGAGAGCAGCCGCCGGAGGCCCGCGCGCGCATCGCAGCGGTCGAGGGGCT 
GGAAGCGCAGGGCGCGCGGGTGACCGTGGCAGCGGTGGATGTCGCCGAGG 
15 CCGATCCCATGACGGCGCTGCTGGCCGCCATCGAGCCCCCGTTGCGCGGG 
GTGGTGCACGCCGCCGGCGTCTTCCCCGTGCGTCACCTGGCGGAGACGGA 
30 cGAGGCCCTGCTGGAGTCGGTGCTCCGTCCCAAGGTGGCCGGGAGCTGGC 

TGCTGCACCX3GCTGCTGCGCGACCGGCCTCTCGACCTGTTCGTGCTGTTC 
TCGTCGGGCGCGGCGGTGTGGGGTGGCAAAGGCCAAGGCGCATACGCCGC 
20 GGCCAATGCGTTCCTCaACGGGCTCGOGCACCATCGCCGCGCGCACTCGC 
35 TGCCGGCGTTGAGCCTCGCCTGGGGCTTATGGGCCGAGGGAGGCATGGTT 

GATGCAAAGGCTCATGCACGTCTGAGCGACATCGGGGTCCTGCCCATGGC 
CACGGGGCCGGCCTTGTCGGCGCTGGAGCGCCTGGTGAACACCAGCGCTG 
TCCAGCGTTCGGTCACACGGATGGACTGGGCGCGCTTCGCGCCGGTCTAT 
2 5 GCCGCGCGAGGGCGGCGCAACTTGCTTTCGGCTCTGGTCGCGGAGGACGA 
GCGCGCTGCGTCTCCCCCGGTGCCGACGGCAAACCGGATCTGGCGCGGCC 
TGTCCGTTGCGGAGAGCCGCTCAGCCCTCTACGAGCTCGTTCGCGGCATC 
GTCGCCCGGGTGCTGGGCTTCTCCGACCCGGGCGCGCTCGACGTCGGCCG 
AGGCTTCGCCGAGCAGGGGCTCGACTCCCTGATGGCTCTGGAGATCCGTA 
ACCGCCTTCAGCGCGAGCTGGGCGAACGGCTGTCGGCGACTCTGGCCTTC 
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< GACCACCCGACGGTGGAGCGGCTG3TGGCGCATCTCCTCACCGACGTGCT 
GAAGCTGGAGGACCGGAGCGACACCCGGCACATCCGGTCGGTGGCGGCGG 
° atgacGACATCGCCATCGTCGGTGCCGCCTGCCGGTTCCCAGGTGGGGAT 

GAGGGCCTGGAGACATACTGGCGGCATCTGGCCGAGGGCATGGTGGTCAG 
5 CACCGAGGTGCCAGCCGACCGGTGGCGCGCGGCGGACTGGTACGACCCCG 
15 ATCCGGAGGTTCCGGGCCGGACCTATGTGGCCAAGGGTGCCTTCCTCCGC 

GATGTGCGCAGCTTGGATGCGGCGTTCTTCGCCATTTCCCCTCGTGAGGC 
GATGAGCCTGGACCCGCAACAGCG3CTGTTGCTGGAGGTGAGCTGGGAGG 
CGATCGAGCGCGCTGGCCAGGACCCGATGGCGCTGCGCGAGAGCGCCACG 
20 1 0 GGCGTGTTC GTGGGCATGATCGGGAGCGAGCACGCCGAGCGGGTGCAGGG 

CCTCGACGACGACGCGGCGTTGCTC-TACGGCACCACCGGCAACCTGCTCA 
GCGTCGCCGCTGGACGGCTGTCGTTCTTCCTGGGTCTGCACGGCCCGACG 
ATGACGGTGGACACCGCCTGCTCGTCGTCGCTGGTGGCGTTGCACCTCGC 
25 CTGCCAGAGCCTGCGATTGGGCGAGTGCGACCAGGCCCTGGCCGGCGGGT 
15 CCAGCGTGCTTTTGTCGCCGCGGTCATTCGTCGCGGCGTCGCGCATGCGT 
TTGCTTTCGCCAGATGGGCGGTGCAAGACGTTCTCGGCCGCTGCAGACGG 
30 CTTTGCGCGGGCCGAGGGCTGCGCCGTGGTGGTGCTCAAGCGGCTCCGTG 

ACGCGCAGCGCGACCGCGACCCCA7CCTGGCGGTGGTCAGGAGCACGGCG' 
ATCAACCACGATGGCCCGAGCAGCGGGCTCACGGTGCCCAGCGGTCCTGC 
20 CCAGCAGGCGTTGCTACGCCAGGCGCTGGCGCAAGCGGGCGTGGCGCCGG 
35 ccGAGGTCGATTTCGTGGAGTGCCACGGGACGGGGACAGCGCTGGGTGAC 

CCGATCGAGGTGCAGGCGCTGGGCGCGGTGTACGGGCGGGGCCGCCCCGC 
GGAGCGGCCGCTCTGGCTGGGCGCTGTCAAGGCCAACCTCGGCCACCTGG 
AGGCCGCGGCGGGCTTGGCCGGCGTGCTCAAGGTGCTCTTGGCGCTGGAG 

40 m 

2 5 CACGAGCAGATTCCGGCTCAACCG3AGCTCGACGAGCTCAACCCGCACAT 

CCCGTGGGCAGAGCTGCCAGTGGC CGTTGTCCGCAGGGCGGTCCCCTGGC 
CGCGCGGCGCGCGCCCGC3TCGTC-CAGGCGTGAGCGCTTTCGGCCTGAGC 
45 GGGACCAACGCGCATGTGGTGTTG3AGGAGGCGCCGGCGGTGGAGCCTGT 

GGCCGCGGCCCCCGAGCGCGCAGC3GAGCTGTTCGTCCTGTCGGCGAAGA 

3 0 GCC-CGGCGGCGCTGGATGCGCAG 3 CAGCCCGGCTGCGGGACCACCTGGAG 
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AAGCATGTCGAGCTTGGCCTCGGCGATGTGGCGTTCAGCCTGGCGACGAC 
GCGCAGCGCGATGGAGCACCGGCTGGCGGTGGCCGCGAGCTCGCGCGAGG 

10 

CGCTGCGAGGGGCGCTTTCGGCCGCAGCGCAGGGGCACACGCCGCCGGGA 
GCCGTGCGTGGGCGGGCCTCGGGCGGCAGCGCGCCGAAGGTGGTCTTCGT 
5 GTTTCCCGGCCAGGGCTCGCAGTGGGTGGGCATGGGCCGAAAGCTCATGG 

15 CCGAAGAGCCGGTCTTCCGGGCGGCGCTGGAGGGTTGCGACCGGGCCATC 

GAGGCGGAAGCGGGCTGGTCGCTGCTCGGGGAGCTCTCCGCCGACGAGGC 
CGCCTCGCAGCTCGGGCGCATCGACGTGGTTCAGCCGGTGCTGTTCGCCA 
TGGAAGTAGCGCTTTCTGCGCTGTGGCGGTCGTGGGGAGTGGAGCCGGAA 

20 :o GCGGTGGTGGGCCACAGCATGGGCGAGGTTGCGGCGGCGCACGTGGCCGG 

CGCGCTGTCGCTCGAGGACGCGGTGGCGATCATCTGCCGGCGCAGCCGGC 
TGCTGCGGCGGATCAGCGGTCAGGGGGAGATGGCGCTGGTCGAGCTGTCG 
CTGGAGGAGGCCGAGGCGGCGCTGCGTGGCCATGAGGGTCGGCTGAGCGT 

25 

GGCGGTGAGCAACAGCCCGCGCTCGACCGTGCTCGCCGGCGAGCCGGCGG 
1 5 CGCTCTCGGAGGTGCTGGCGGCGCTGACGGCCAAGGGGGTGTTCTGGCGG 
CAGGTGAAGGTGGACGTCGCCAGCCATAGCCCGCAGGTCGACCCGCTGCG 
30 CGAAGAGCTGATCGCGGCGCTGGGAGCGATCCGGCCGCGAGCGGCTGCGG 

TGCCGATGCGCTCGACGGTGACGGGCGGGGTGATCGCGGGTCCGGAGCTC 
GGTGCGAGCTACTGGGCGGACAACCTTCGGCAGCCGGTGCGCTTCGCTGC 
2 0 GGCGGCGCAAGCGCTGCTGGAGGGTGGCCCCGCGCTGTTCATCGAGATGA 
3 5 GCCCGCACCCGATCCTGGTGCCGCCCCTGGACGAGATCCAGACGGCGGCC 

GAGCAAGGGGGCGCTGCGGTGGGCTCGCTGCGGCGAGGGCAGGACGAGCG 
CGCGACGCTGCTGGAGGCGCTGGGGACGCTGTGGGCGTCCGGCTATCCGG 
TGAGCTGGGCTCGGCTGTTCCCCGCGGGCGGCAGGCGGGTTCCGCTGCCG 

40 

2 5 ACCTATCCCTGGCAGCACGAGCGGTACTGGATCGAGGACAGCGTGCATGG 
GTCGAAGCCCTCGCTGCGGCTTCGGCAGCTTCGCAACGGCGCCACGGACC 
ATCCGCTGCTCGGGGCTCCATTGCTCGTCTCGGCGCGACCCGGAGCTCAC 
45 TTGTGGGAGCAAGCGCTGAGCGACGAGAGGCTATCCTACCTTTCGGAACA 

TAGGGTCCATGGCGAAGCCGTGTTGCCCAGCGCGGCGTATGTAGAGATGG 
30 CGCTCGCCGCCGGCGTAGATCTCTATGGCACGGCGACGCTGGTGCTGGAG 
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CAGCTGGCGCTCGAGCGAGCCCTCGCCGTGCCCTCCGAAGGCGGACGCAT 
CGTGCAAGTGGCCCTCAGCGAAGAAGGTCCCGGTCGGGCCTCATTCCAGG 
TATCGAGTCGTGAGGAGGCAGGTAGGAGCTGGGTGCGGCACGCCACGGGG 
CACGTGTGTAGCGGCCAGAGCTCAGCGGTGGGAGCGTTGAAGGAAGCTCC 
5 GTGGGAGATTCAACGGCGATGTCCGAGCGTCCTGTCGTCGGAGGCGCTCT 
ATCCGCTGCTCAACGAGCACGCCCTCGACTATGGTCCCTGCTTCCAGGGC 
GTGGAGCAGGTGTGGCTCGGCACGGGGGAGGTGCTCGGCCGGGTACGCTT 
GCCAGGAGACATGGCATCCTCAAGTGGCGCCTACCGGATTCATCCCGCCT 
TGTTGGATGCATGTTTTCAGGTGCTGACAGCGCTGCTCACCACGCCGGAA 
20 io TCCATCGAGATTCGGAGGCGGCTGACGGATCTCCACGAACCGGATCTCCC 

• GCGGTCCAGGGCTCCGGTGAATCAAGCGGTGAGTGACACCTGGCTGTGGG 
ACGCCGCGCTGGACGGTGGACGGCGCCAGAGCGCGAGCGTGCCCGTCGAC 
CTGGTGCTCGGCAGCTTCCATGCGAAGTGGGAGGTCATGGAGCGCCTCGC 
GCAGGCGTACATCATCGGCACTCTCCGCATATGGAACGTCTTCTGCGCTG 
1 5 CTGGAGAGCGTCACACGATAGACGAGTTGCTCGTCAGGCTTCAAATCTCT 
GTCGTCTACAGGAAGGTCATCAAGCGATGGATGGAACACCTTGTCGCGAT 
CGGCATCCTTGTAGGGGACGGAGAGCATTTTGTGAGCTCTCAGCCGCTGC 
CGGAGCCTGATTTGGCGGCGGTGCTCGAGGAGGCCGGGAGGGTGTTCGCC 
GACCTCCCAGTCCTATTTGAGTGG7GCAAGTTTGCCGGGGAACGGCTCGC 
20 GGACGTATTGACCGGTAAGACGCTCGCGCTCGAGATCCTCTTCCCTGGTG 
35 GCTCGTTCGATATGGCGGAGCGAATCTATCGAGATTCGCCCATCGCCCGT 

TACTCGAACGGCATCGTGCGCGGTGTCGTCGAGTCGGCGGCGCGGGTGGT 
AGCACCGTCGGGAATGTTCAGCATCTTGGAGATCGGAGCAGGGACGGGCG 
CGACCACCGCCGCCGTCCTCCCGGTGTTGCTGCCTGACCGGACGGAGTAC 

40 

2 S CATTTCACCGATGTTTCTCCGCTCTTCCTTGCTCGCGCGGAGCAAAGATT 
TCGAGATTATCCATTCCTGAAGTATGGCATTCTGGATGTCGACCAGGAGC 
CAGCTGGCCAGGGATACGCACATCAGAGGTTTGACGTCATCGTCGCGGCC 
45 AATGTCATCCATGCGACCCGCGATATAAGAGCCACGGCGAAGCGTCTCCT 

GTCGTTGCTCGCGCCCGGAGGCCTTCTGGTGCTGGTCGAGGGCACAGGGC 
30 ATCCGATCTGGTTCGATATCACCACGGGATTGATTGAGGGGTGGCAGAAG 
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TACGAAGATGATCTTCGTATCGACCATCCGCTCCTGCCTGCTCGGACCTG 
GTGTGACGTCCTGCGCCGGGTAGGCTTTGCGGACGCCGTGAGTCTGCCAG 
GCGACGGATCTCCGGCGGGGATCCTCGGACAGCACGTGATCCTCTCGCGC 
GCGCCGGGCATAGCAGGAGCCGCTTGTGACAGCTCCGGTGAGTCGGCGAC 
5 CGAATCGCCGGCCGCGCGTGCAGTACGGCAGGAATGGGCCGATGGCTCCG 
CTGACGTCGTCCATCGGATGGCGTTGGAGAGGATGTACTTCCACCGCCGG 
CCGGGCCGGCAGGTTTGGGTCCACGGTCGATTGCGTACCGGTGGAGGCGC 
GTTCACGAAGGCGCTCGCTGGAGATCTGCTCCTGTTCGAAGACACCGGGC 
AGGTCGTGGCAGAGGTTCAGGGGCrCCGCCTGCCGCAGCTCGAGGCTTCT 
20 i o GCTTTCGCGCCGCGGGACCCGCGGGAAGAGTGGTTGTACGCTTTGGAATG 

GCAGCGCAAAGACCCTATACCAGAGGCTCCGGCAGCCGCGTCTTCTTCCT 
CCGCGGGGGCTTGGCTCGTGCTGATGGACCAGGGCGGGACAGGCGCTGCG 
CTCGTATCGCTGCTGGAAGGGCGAGGCGAGGCGTGCGTGCGCGTCATCGC 
GGGTACGGCATACGCCTGCCTCGCGCCGGGGCTGTATCAAGTCGATCCGG 
1 5 CGCAGCCAGATGGCTTTCATACCCTGCTCCGCGATGCATTCGGCGAGGAC 
CGGATTTGTCGCGCGGTAGTGCATATGTGGAGCCTTGATGCGACGGCAGC 
AGGGGAGAGGGCGACAGCGGAGTC G CTTCAGGCCGATCAACTCCTGGGGA 
GCCTGAGCGCGCTTTCTCTGGTGCAGGCGCTGGTGCGCCGGAGGTGGCGC 
AACATGCCGCGGCTTTGGCTCTTGACCCGCGCCGTGCATGCGGTGGGCGC 
20 GGAGGACGCAGCGGCCTCGGTGGCC-CAGGCGCCGGTGTGGGGCCTCGGTC 
35 GGACGCTCGCGCTCGAGCATCCAGAGCTGCGGTGCACGCTCGTGGACGTG 

. AACCCGGCGCCGTCTCCAGAGGACGCAGCCGCACTGGCGGTGGAGCTCGG 
GGCGAGCGACAGAGAGGACCAGGTCGCATTGCGCTCGGATGGCCGCTACG 
TGGCGCGCCTCGTGCGGAGCTCCTTTTCCGGCAAGCCTGCTACGGATTGC 

2 5 GGCATCCGGGCGGACGGCAGCTATGTGATCACCGATGGCATGGGGAGAGT 
GGGGCTCTCGGTCGCGCAATGGATGGTGATGCAGGGGGCCCGCCATGTGG 
TGCTCGTGGATCGCGGCGGCGCTTCCGAGGCATCCCGGGATGCCCTCCGG 

45 TCCATGGCCGAGGCTGGCGCGGAGGTGCAGATCGTGGAGGCCGACGTGGC 

TCGGCGCGACGATGTCGCTCGGCTCCTCTCGAAGATCGAACCGTCGATGC 

3 0 CGCCGCTTCGGGGGATCGTGTACC-rGGACGGGACCTTCCAGGGCGACTCC 
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TCGATGCTGGAGCTGGATGCCCGTCGCTTCAAGGAGTGGATGTATCCCAA 
GGTGCTCGGAGCGTGGAACCTGCACGCGCTGACCAGGGATAGATCGCTGG 
ACTTCTTCGTCCTGTATTCCTCGGGCACCTCGCTTCTGGGCTTGCCAGGA 
CAGGGGAGCCGCGCCGCCGGTGACGCCTTCTTGGACGCCATCGCGCATCA 
5 CCGGTGCAAGGTGGGCCTTACAGCGATGAGCATCAACTGGGGATTGCTCT 
CCGAAGCATCATCGCCGGCGACCCCGAACGACGGCGGAGCACGGCTCGAA 
TACCGGGGGATGGAAGGCCTCACGCTGGAGCAGGGAGCGGCGGCGCTCGG 
GCGCTTGCTCGCACGACCCAGGGCGCAGGTAGGGGTGATGCGGCTGAATC 
TGCGCCAGTGGTTGGAGTTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
20 1 0 XXXXTATGGTATAACTTATTGATTATAATACAGTATACAAAGGTCCCTTT 

TCAGGGACCCTTTCGTATGTTGTAC-CTGATTTTATTTTCTTCTTTTCTTT 
TGGGTGCTTTTAATAGCCATATAiGATGTTTCATATAAAGTTAAAAGTTT 
CCCCATCTGTCTATCTATAGCGTCATGTTTTTCAGGATTTCTTAATTTCT 

25 

GCAGCAGTTTAGATATATATTTCGCAGGTAACTTTATTATTAATGAATTA 
1 5 TGGGATTGATACATTTCCTTTGTCCAATCTAAATCTTTTCCTGATTTCAA 
TACCTCACTCTCATCTCTATTAATTATAAAACCGAGCTTTCCATACGGAC 
30 CTGTTAAATATGATTGTAATTGTCTATACTCAGATGGACCTAATTCTTCA 

AAATTTTTAGCATCAAAAACAACTTGTCTTGTTTTATAGTCCTCCAATAC 
TCGTTTCCAAAAATCAGATTTGCCACCATTGGTGCCTATAATATCTCGTC 
2 0 TCTGAACTGCGTTACCATTTGGATGGGACTTGATGTCTGTTAGGTGGGAT 
GCAAATACTATTCTTAGTGCGTCTAAACACCATTGTTCAAATTCAGTGGC 
ACCTTCATTTCCTATCGGTATCTGATCTAGATGAGTAGTTATTTGACCTA 
TTGTTTTATTTCTAATGGCTGAATTATCTGAAATAATATTAATATCATAT 
TCGTCATTTATCTCCTCTGCTTCCTCTGGAGCAAGTGCATTACGATTTAG 

2 5 ATTCAAACCAAGCCAGTAACATGGATGAATTAATAATTTCTCATTACTTT 
CAAAACCTTTATCTGGAGTTCGCCCGTCATGGCAAAATGAATAGGATGAG 
GTGTTTTTATCACGAATACCAACAAATCCTACACTATACAAACTTTGGAG 

45 AATTCCACTTGCCTTTAACAACTGTATTTCAGACGTTATTTTAGGATCTC 

CATTTTCTTCGATTAATTCGAAAGATGCTTCTATTTTTTTTAAGCACGTA 

3 0 TAAACTGTTAATTCAGGTTCAATGCTACGAAATGCACTAGTTATAACCTG 
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TATTGAAGGAAAGATCTTCTGATACTCTTTCCAGAGATCTTCAAGTCTGG 
CCATGGAAATTGACTTGGCTGCATATTCTAGGTCAGTGTTTATGATAGTT 
TCTCTATTCTCTCTGAATGCGGAAAAAAAAGCTTCATTCAACAATGATAG 
TAAATC CCTGGGCCGGTAAAGGGTAAATTGCAAACATCGCTTAAAACCAT 
5 TCCTCCCTTTAAGATCATCCGCTGTGCATCTATCCCAAACTCGTTGATCT 
TTCTCAATATCTAGCTTAAATGCTACTTTCATTCTTTTAGCTGACAGCAT 
TAGGAGTTGTGCCCAGTCCCAATGCAACCTTATGACTTGACCCTCTATAT 
TTCTCGAGTAATCAGGATCTTCCTTTGATAGCGACCTAAATATATTATCC 
CTTAAAAAAATTATTGGACGAATGCATTTTGCTTTTTGATTTAATTCAAT 
20 io AGATGCATATGCTAGACCTGCAATGATTCCAATTCCTATATTATCCGGTT 

CATACGCCTCATCTAGCTTATCCATTAATATGACAACTTTCCTGTCTGAG 
CGTTCAAGAAGTGATACTATATTATTTTCTATTTCTGAGATATTCAAATT 
GAATTGAAGATCACCAATTGATTCTTCTGGGTTATTTTCATCTAAATACT 

25 

CCTTTGCGACAAGCCTACACTTTCTTAAAATGTCACCTTGTGCAGAATTC 
1 5 CATTTTTTCAAATGTTCATTCAACAATGTTTCTGATGATATTTGAGATGA 
CAATTTGTAATGAGATGATATATATGATGCTATCTCCATTAGCATAGCGT 
30 ATCGCCATAATAGTCTTGTTGCTGCCCTTGCTAAATTAAATGATCCTGTA 

AATGGTTTCAACATTGATCTGAAACCAATAATTTGAGAATCGTCTGGTGA 
GAAACTCAGGATTAATATTTTTTTGTCTTTCTTCCAATGCTCATTTAGCT 
2 0 GAATAAATAAAGCACTTTTACCTGTCCCTCGTCTACCAACAACAATGGTC 
35 CTGTCATCAGTTTCAATTAGAGTCCTAAAGTCAGCAGTTTCAATGAAAGC 

ATTACTCAACATCTTTTTATCATTTTCTGCTGTCGTATCACCAAACGGAT 
TAGACTTTGAAGTAATATTCAATTCCATATTCAACCTTTTATGTTAGTTG 
CTTTCATTTATTACTTTATATACTGTTGAACGAGCAATATTCATTGTTTT 

40 

2 5 TGATATATGTGAGGCACCTAACCCCTGTTGCCACATATTTAATACTGCAT 

CTCTATCTATTTTTCTTTTTCTACCAAAAACAACTCCTTTTGCCA 

45 SEQ ID NO 82 (>Contig57) 

TCATCTATTGTATAGTTTGTATATTGATATGATATAATTATAACATATAA 

3 0 ACAGTAAACTTTCTCTACGTAGATCGAGGAGAAGACTCAATTTGTTGACA 
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TCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACTACTCGCATACCGTTG 

CGCAACAGCGGCGCGAGGAGCAGGACGCATACGACATCACCGGCAATACG 

CTCAGCGTCGCCGACGGACGGTTGTCTTATACGCTAGGGCTGCAGGGACC 

CTGCCTGACCGTCGACACGGTCTGCTCGTCGTCGCTCGTGGCCATCCACC 

TTGCCTGCCGCAGCCTGCGCGCTCGCGAGAGCGATCTCGCGCTGGCGGGA 

GGCGTCAACATGCTCCTTTCGTCCAAGACGATGATAATGCTGGGGCGCAT 

CCAGGCGCTGTCGCCCGATGGCCACTGCCGGACATTCGACGCCTCGGCCA 

ACGGGTTCGTCCGTGGGGAGGGCTGCGGTATGGTCGTGCTCAAACGGCTC 

TCCGACGCCCAGCGACACGGCGATCGGATCTGGGCTCTGATCCGGGGTTC 

GGCCATGAATCAGGATGGCCGGTCGACAGGGTTGATGGCACCCAATGTGC 

TCGCTCAGGAGGCGCTCTTGCGCGAGGCGCTGCAGAGCGCTCGCGTCGAC 

GCCGGGGCCATCGGTTATGTCGAGACCGACGGAACGGGGACCTCGCTCGG 

CGACCCGATCGAGGTCGAGGCGCTGCGTGCCGTGTTGGGGCCGGCGCGGG 

CCGATGGGAGCCGCTGCGTGCTGGGCGCAGTGAAGACAAACCTCGGCCAC 

CTGGAGGGCGCTGCAGGCGTGGCGGGTTTGATCAAGGCGGCGCTGGCTCT 

GCACCACGAACTGATCCCGCGAAACCTCCATTTCCACACGCTCAATCCGC 

GGATCCGGATCGAGGGGACCGCGCTCGCGCTGGCGACGGAGCCGGTGCCG 

TGGCCGCGGGCGGGCCGACCGCGCTTCGCGGGGGTGAGCGCGTTCGGCCT 

CAGCGGCACCAACGTCCATGTCGTGCTGGAGGAGGCGCCGGCCACGGTGC 

TCGCACCGGCGACGCCGGGGCGCTCAGCGGAGCTTTTGGTGCTGTCGGCG 

AAGAGCGCCGCCGCGCTGGACGCACAGGCGGCGCGGCTCTCAGCGCACAT 

CGCCGCGTACCCGGAGCAGGGTCTCGGAGACGTCGCGTTCAGCCTGGTAT 

CGACGCGTAGCCCGATGGAGCACCGGCTCGCGGTGGCGGCGACCTCGCGC 

GAGGCGCTGCGAAGCGCGCTGGAGGTTGCGGCGCAGGGGCAGACCCCGGC 

AGGCGCGGCGCGCGGCAGGGCCGCTTCCTCGCCCGGCAAGCTCGCCTTCC 

TGTTCGCCGGGCAGGGCGCGCAGGTGCCGGGCATGGGCCGTGGGTTGTGG 

GAGGCGTGGCCGGCGTTCCGCGAGACCTTCGACCGGTGCGTCACGCTCTT 

CGACCGGGAGCTCCATCAGCCGCTCTGCGAGGTGATGTGGGCCGAGCCGG 

GCAGCAGCAGGTCGTCGTTGCTGGACCAGACGGCGTTCACCCAGCCGGCG 

CTCTTTGCGCTGGAGTACGCGCTGGCCGCGCTCTTCCGGTCGTGGGGCGT 
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GGAGCCGGAGCTCGTCGCTGGCCATAGCCTCGGCGAGCTGGTGGCCGCCT 
GCGTGGCGGGTGTGTTCTCCCTCGAGGACGCCGTGCGCTTGGTGGTCGCG 
CGCGGCCGGTTGATGCAGGCGCTGCCGGCCGGCGGCGCGATGGTATCGAT 
CGCCGCGCCGGAGGCCGACGTGGCTGCCGCGGTGGCGCCGCACGCAGCGT 
5 TGGTGTCGATCGCGGCAGTCAATGGGCCGGAGCAGGTGGTGATCGCGGGC 
75 GCCGAGAAATTCGTGCAGCAGATC GCGGCGGCGTTCGCGGCGCGGGGGGC 

GCGAACCAAACCGCTGCATGTCTCGCACGCGTTCCACTCGCCGCTCATGG 
ATCCGATGCTGGAGGCGTTCCGGCGGGTGACTGAGTCGGTGACGTACCGG 
CGGCCTTCGATCGCGCTGGTGAGCAACCTGAGCGGGAAGCCCTGCACCGA 
20 10 TGAGGTGAGCGCGCCGGGTTACTG3GTGCGTCACGCGCGAGAGGCGGTGC 

' GCTTCGCGGACGGAGTGAAGGCGC~GCACGCGGCCGGTGCGGGCCTCTTC 
GTCGAGGTGGGGCCGAAGCCGACGCTGCTCGGCCTTGTGCCGGCCTGCCT 
GCCGGATGCCAGGCCGGTGCTGCTCCCAGCGTCGCGCGCCGGGCGTGACG 

25 

AGGCTGCGAGCGCGCTAGAGGCGCTGGGTGGGTTCTGGGTCGTCGGTGGA 
15 TCGGTCACCTGGTCGGGTGTCTTCCCTTCGGGCGGACGGCGGGTACCGCT 
GCCAACCTATCCCTGGCAGCGCGAGCGTTACTGGATCGAAGCGCCGGTCG 

30 ATCGTGAGGCGGACGGCACCGGC CGTGCTCGGGCGGGGGGCCACCCCCTT 

CTGGGTGAAGTCTTTTCCGTGTCGACCCATGCCGGTCTGCGCCTGTGGGA 
GACGACGCTGGACCGAAAGCGGCTGCCGTGGCTCGGCGAGCACCGGGCGC 
2 0 AGGGGGAGGTCGTGTTTCCTGGCGCCGGGTACCTGGAGATGGCGCTGTCG 

35 TCGGGGGCCGAGATCTTGGGCGATGGACCGATCCAGGTCACGGATGTGGT 

GCTCATCGAGACGCTGACCTTCGCGGGCGATACGGCGGTACCGGTCCAGG 
TGGTGACGACCGAGGAGCGACCGGGACGGCTGCGGTTCCAGGTAGCGAGT 
CGGGAGCCGGGGGAACGTCGCGCGCCCTTCCGGATCCACGCCCGCGGCGT 

40 

2 5 GCTGCGGCGGATCGGGCGCGTCGAGACCCCGGCGAGGTCGAACCTCGCCG 

CCCTGCGCGCCCGGCTTCATGCCGCCGTGCCCGCTGCGGCTATCTATGGT 
GCGCTCGCCGAGATGGGGCTTCAATACGGCCCGGCGTTGCGGGGGCTCGC 
45 CGAGCTGTGGCGGGGTGAGGGCGAGGCGCTGGGCAGGGTGAGACTGCCTG 

AGGCCGCCGGCTCCGCGACAGCCTACCAGCTGCATCCGGTGCTGCTGGAC 

3 0 GCGTGCGTCCAAATGATTGTTGGCGCGTTCGCCGATCGCGATGAGGCGAC 
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GCCGTGGGCGCCGGTGGAGGTGGGCTCGGTGCGGCTGTTCCAGCGGTCTC 
CTGGGGAGCTATGGTGCCATGCGCGCGTCGTGAGCGATGGTCAACAGGCC 
TCCAGCCGGTGGAGCGCCGACTTTGAGTTGATGGACGGTACGGGCGCGGT 
GGTCGCCGAGATCTCCCGGCTGGTGGTGGAGCGGCTTGCGAGCGGTGTAC 
5 GCCGGCGCGACGCAGACGACTGGTTCCTGGAGCTGGATTGGGAGCCCGCG 
15 GCGCTCGGTGGGCCCAAGATCACAGCCGGCCGGTGGCTGCTGCTCGGCGA 

GGGTGGTGGGCTCGGGCGCTCGTTGTGCTCGGCGCTGAAGGCCGCCGGCC 
ATGTCGTCGTCCACGCCGCGGGGGACGACACGAGCACTGCAGGAATGCGC 
GCGCTCCTGGCCAACGCGTTCGACGGQCAGGCCCCGACGGCCGTGGTGCA 
20 10 CCTCAGCAGCCTCGACGGGGGCGGCCAGCTCGGCCCGGGGCTCGGGGCGC 

AGGGCGCGCTCGACGCGCCCCGGAGCCCAGATGTCGATGCCGATGCCCTC 
GAATCGGCGCTGATGCGTGGTTGCGACAGCGTGCTCTCCCTGGTGCAAGC 
GCTGGTCGGCATGGACCTCCGAAACGCGCCGCGGCTGTGGCTCTTGACCC 

25 

GCGGGGCTCAGGCGGCCGCCGCCGGCGATGTCTCCGTGGTGCAAGCGCCG 
15 CTGTTGGGGCTGGGCCGCACCATCGCCTTGGAGCACGCCGAGCTGCGCTG 
TATCAGCGTCGACCTCGATCCAGCCGAGCCTGAAGGGGAAGCCGATGCTT 
30 TGCTGGCCGAGCTACTTGCAGATGATGCCGAGGAGGAGGTCGCGCTGCGC 

GGTGGCGACCGGCTCGTTGCGCGGCTCGTCCACCGGCTGCCCGACGCTCA 
GCGCCGGGAGAAGGTCGAGCCCGCCGGTGACAGGCCGTTCCGGCTAGAGA 
2 0 TCGATGAACCCGGCGCGCTGGACCAACTGGTGCTCCGAGCCACGGGGCGG 
CGCGCTCCTGGTCCGGGCGAGGTCGAGATCTCCGTCGAAGCGGCGGGGCT 
CGACTCCATCGACATCCAGCTGGCGTTGGGCGTTGCTCCCAATGATCTGC 
CTGGAGAAGAAATCGAGCCGTTGGTGCTCGGAAGCGAGTGCGCCGGGCGC 
ATCGTCGCTGTGGGCGAGGGCGTGAACGGCCTTGTGGTGGGCCAGCCGGT 
25 GATCGCCCTTGCGGCGGGAGTATTTGCTACCCATGTCACCACGTCGGCCA 
CGCTGGTGTTGCCTCGGCCTCTGGGGCTCTCGGCGACCGAGGCGGCCGCG 
ATGCCCCTCGCGTATTTGACGGCCTGGTACGCCCTCGACAAGGTCGCCCA 
45 CCT3CAGGCGGGGGAGCGGGTGCTGATCCATGCGGAGGCCGGTGGTGTCG 

GTCTTTGCGCGGTGCGATGGGCGCAGCGCGTGGGCGCCGAGGTGTATGCG 
30 ACCGCCGACACGCCCGAGAACCGTGCCTACCTGGAGTCGCTGGGCGTGCG 
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GTACGTGAGCGATTCCCGCTCGGGCCGGTTCGTCACAGACGTGCATGCAT 
GGACGGACGGCGAGGGTGTGGACGTCGTGCTCGACTCGCTTTCGGGCGAG 

10 

CGCATCGACAAGAGCCTCATGGTCCTGCGCGCCTGTGGTCGCCTTGTGAA 
GCTGGGCAGGCGCGACGACTGCGCCGACACGCAGCCTGGGCTGCCGCCGC 
5 TCCTACGGAATTTTTCCTTCTCGCAGGTGGACTTGCGGGGAATGATGCTC 

15 GATCAACCGGCGAGGATCCGTGGGCTCCTCGACGAGCTGTTCGGGTTGGT 

CGCAGCCGGTGCCATCAGCCCACTGGGGTCGGGGTTGCGCGTTGGCGGAT 
CCCTCACGCCACCGCCGGTCGAGACCTTCCCGATCTCTCGCGCAGCCGAG 
GCATTCCGGAGGATGGCGCAAGGACAGCATCTCGGGAAGCTCGTGCTCAC 

^ 10 GCTGGACGACCCGGAGGTGCGGATCCGCGCTCCGGCCGAATCCAGCGTCG 

CCGTCCGCGCGGACGGCACCTACCTTGTGACCGGCGGTCTGGGTGGCCTC 
GGTCTGCGCGTGGCCGGATGGCTGGCCGAGCGGGGCGCGGGGCAACTGGT 
GCTGGTGGGCCGCTCCGGTGCGGCGAGCGCAGAGCAGCGAGCCGCCGTGG 

25 

CGGCGCTGGAGGCCCACGGCGCGCGCGTCACGGTGGCGAAAGCGGACGTC 
1 5 GCCGATCGGTCACAGATCGAGCGGGTCCTCCGCGAGGTTACCGCGTCGGG 
GATGCCGCTGCGGGGTGTCGTGCATGCGGCAGGTCTCGTGGATGACGGGC 
30 TGCTGATGCAGCAGACTCCGGCGCGGTTCCGCACGGTGATGGGACCTAAG 

GTCCAGGGGGCCTTGCACTTGCACACGCTGACACGCGAAGCGCCTCTTTC 
CTTCTTCGTGCTGTACGCTTCTGCAGCTGGGCTTTTCGGCTCGCCAGGCC 
2 0 AGGGCAACTATGCCGCAGCCAACGCGTTCCTCGACGCCCTTTCGCATCAC 
35 CGAAGGGCGCAGGGCCTGCCGGCGCTGAGCATCGACTGGGGCATGTTCAC 

GGAGGTGGGGATGGCCGTTGCGCAAGAAAACCGTGGCGCGCGGCAGATCT 
CTCGCGGGATGCGGGGCATCACCCCCGATGAGGGTCTGTCAGCTCTGGCG 
^ CGCTTGCTCGAGGGTGATCGCGTGCAGACGGGGGTGATACCGATCACTCC 
25 GCGGCAGTGGGTGGAGTTCTACCCGGCAACAGCGGCCTCACGGAGGTTGT 
CGCGGCTGGTGACCACGCAGCGCGCGGTCGCTGATCGGACCGCCGGGGAT 
CGGGACCTGCTCGAACAGCTTGCGTCGGCTGAGCCGAGCGCGCGGGCGGG 
45 GCTGCTGCAGGACGTCGTGCGCGTGCAGGTCTCGCATGTGCTGCGTCTCC 

CTGAAGACAAGATCGAGGTGGATGCCCCGCTCTCGAGCATGGGCATGGAC 
3 0 TCGCTGATGAGCCTGGAGCTGCGCAACCGCATCGAGGCTGCGCTGGGCGT 

50 



55 



WO 00/22139 



PCTAJS99/23S3S 



10 



25 



174 



CGCCGCGCCTGCAGCCTTGGGGTGGACGTACCCAACGGTAGCAGCGATAA 
CGCGCTGGCTGCTCGACGACGCCCTCGTCGTCCGGCTTGGCGGCGGGTCG 
GACACGGACGAATCGACGGCGAGCGCCGGTTCGTTCGTCCACGTCCTCCG 
CTTTCGTCCTGTCGTCAAGCCGCGGGCTCGTCTCTTCTGTTTTCACGGTT 
5 CTGGCGGCTCGCCCGAGGGCTTCCGTTCCTGGTCGGAGAAGTCTGAGTGG 
15 AGCGATCTGGAAATCGTGGCCATGTGGCACGATCGCAGCCTCGCCTCCGA 

GGACGCGCCTGGTAAGAAGTACGTCCAAGAGGCGGCCTCGCTGATTCAGC 
ACTATGCAGACGCACCGTTTGCGTTAGTAGGGTTCAGCCTGGGTGTCCGG 
TTCGTCATGGGGACAGCCGTGGAGCTCGCCAGTCGTTCCGGCGCACCGGC 
20 i c TCCGCTGGCCGTCTTCACGTTGGGCGGCAGCTTGATCTCTTCTTCAGAGA 

TCACCCCGGAGATGGAGACCGATATAATAGCCAAGCTCTTCTTCCGAAAT 
GCCGCGGGTTTCGTGCGATCCACCCAACAAGTCCAGGCCGATGCTCGCGC 
AGACAAGGTCATCACAGACACCATGGTGGCTCCGGCCCCCGGGGACTCGA 
AGGAGCCGCCCGTGAAGATCGCGGTCCCTATCGTCGCCATCGCCGGCTCG 
1 5 GACGATGTGATCGTGCCTCCGAGCGACGTTCAGGATCTACAATCTCGCAC 
CACGGAGCGCTTCTATATGCATCTCCTTCCCGGAGATCACGAATTTCTCG 
30 TCGATCGAGGGCGCGAGATCATGCACATCGTCGACTCGCATCTCAATCCG 

CTGCTCGCCGCGAGGACGACGTCGTCAGGCCCCGCGTTCGAGGCAAAATG 
ATGGCAGCCTCCCTCGGGCGCGCGAGATGGTTGGGAGCAGCGTGGGCGCT 
2 0 GGCGGCCGGCGGCAGGCCGCGGAGGCGCATGAGCCTTCCTGGACGTTTGC 
AGTATAGGAGATTTTATGACACAGGAGCAAGCGAATCAGAGTGAGACGAA 
GCCTGCTTTCGACTTCAAGCCGTTCGCGCCTGGGTACGCGGAGGACCCGT 
TCCCCGCGATCGAGCGCCTGAGAGAGGCAACCCCCATCTTCTACTGGGAT 
GAAGGCCGCTCCTGGGTCCTCACCCGATACCACGACGTGTCGGCGGTGTT 

2 5 CCGCGACGAACGCTTCGCGGTCAGTCGAGAAGAGTGGGAATCGAGCGCGG 

AGTACTCGTCGGCCATTCCCGAGCTCAGCGATATGAAGAAGTACGGATTG 
TTCGGGCTGCCGCCGGAGGATCACGCTCGGGTCCGCAAGCTCGTCAACCC 
45 GTCGTTTACGTCACGCGCCATCGACCTGCTGCGCGCCGAAATACAGCGCA 

CCGTCGACCAGCTGCTCGATGCTCGCTCCGGACAAGAGGAGTTCGACGTT 

3 0 GTGCGGGATTACGCGGAGGGAATCCCGATGCGCGCGATCAGCGCTCTGTT 
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GAAGGTTCCGGCCGAGTGTGACGAGAAGTTCCGTCGCTTCGGCTCGGCGA 
CTGCGCGCGCGCTCGGCGTGGGTTTGGTGCCCCAGGTCGATGAGGAGACC 
AAGACCCTGGTCGCGTCCGTCACCGAGGGGCTCGCGCTGCTCCATGACGT 
CCTCGATGAGCGGCGCAGGAACCCGCTCGAAAATGACGTCTTGACGATGC 
5 TGCTTCAGGCCGAGGCCGACGGCAGCAGGCTGAGCACGAAGGAGCTGGTC 
15 GCGCTCGTGGGTGCGATTATCGCTGCTGGCACCGATACCACGATCTACCT 

TATCGCGTTCGCTGTGCTCAACCTGCTGCGGTCGCCCGAGGCGCTCGAGC 
TGGTGAAGGCCGAGCCCGGGCTCATGAGGAACGCGCTCGATGAGGTGCTC 
CGCTTCGACAATATCCTCAGAATAGGAACTGTGCGTTTCGCCAGGCAGGA 
20 10 CCTGGAGTACTGCGGGGCATCGATCAAGAAAGGGGAGATGGTCTTTCTCC 

TGATCCCGAGCGCCCTGAGAGATGGGACTGTATTCTCCAGGCCAGACGTG 
TTTGATGTGCGACGGGACACGGGCGCGAGCCTCGCGTACGGTAGAGGCCC 
CCATGTCTGCCCCGGGGTGTCCCTTGCTCGCCTCGAGGCGGAGATCGCCG 

25 

TGGGCACCATCTTCCGTAGGTTCCCCGAGATGAAGCTGAAAGAAACTCCC 
1 5 GTGTTTGGATACCACCCCGCGTTCCGGAACATCGAATCACTCAACGTC AT 
CTTGAAGCCCTCCAAAGCTGGATAGCTCGCGGGGGTATCGCTTCCCGAAC 
30 CTCATTCCCTCATGATACAGCTCGCGCGCGGGTGCTGTCTGCCGCGGGTG 

CGATTCGATCCAGCGGACAAGCCCATTGTCAGCGCGCGAAGATCGAATCC 
ACGGCCCGGAGAAGAGCCCGTCCGGGTGACGTCGGAAGAAGTGCCGGGCG 

2 0 CCGCCCTGGGAGCGCAAAGCTCGCTCGTTCGCGCTCAGCACGCCGCTCGT 
35 CATGTCCGGCCCTGCACCCGCGCCGAGGAGCCGCCCGCCCTGATGCACGG 

CCTCACCGAGCGGCAGGTTCTGCTCTCGCTCGTCGCCCTCGCGCTCGTCC 
TCCTGACCGCGCGCGCCTTCGGCGAGCTCGCGCGGCGGCTGCGCCAGCCC 
GAGGTGCTCGGCGAGCTCTTCGGCGGCGTGGTGCTGGGCCCGTCCGTCGT 

40 

25 CGGCGCGCTCGCTCCTGGGTTCCATCGAGTCCTCTTCCAGGATCCGGCGG 
TCGGGGTCGTGCTCTCCGGCATCTCCTGGATAGGCGCGCTCGTCCTGCTG 
CTCATGGCGGGTATCGAGGTCGATGTGAGCATCCTGCGCAAGGAGGCGCG 
45 CCCCGGGGCGCTCTCGGCGCTCGGCGCGATCGCGCCCCCGCTGCGCACGC 

CGGGGCCGCTGGTGCAGCGCATGCAGGGCGCGTTCACGTGGGATCTCGAC 

3 0 GTCTCGCCGCGACGCTCTGCGCAAGCCTGAGCCTCGGCGCCTGCTCGTAC 
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ACCTCGCCGGTGCTCGCTCCGCCCGCGGACATCCGGCCGCCCGCCGCGGC 
CCAGCTCGAGCCGGACTCGCCGGATGACGAGGCCGACGAGGCCGACGAGG 
CGCTCCGCCCGTTCCGCGACGCGATCGCCGCGTACTCGGAGGCCGTTCGG 
TGGGCGGAGGCGGCGCAGCGGCCGCGGCTGGAGAGCCTCGTGCGGCTCGC 
5 GATCGTGCGGCTGGGCAAGGCGCTCGACAAGGTCCCTTTCGCGCACACGA 
CGGCCGGCGTCTCCCAGATCGCCGGCAGACTCCAGAACGATGCGGTCTGG 
TTCGATGTCGCCGCCCGGTACGCGAGCTTCCGCGCGGCGACGGAGCACGC 
GCTCCGCGACGCGGCGTCGGCCATGGAGGCGCTCGCGGCCGGCCCGTACC 
GCGGATCGAGCCGCGTGTCCGCTGCCGTAGGGGAGTTTCGGGGGGAGGCG 
10 GCGCGCCTTCACCCCGCGGACCGTGTACCCGCGTCCGACCAGCAGATCCT 
GACCGCGCTGCGCGCAGCCGAGCGGGCGCTCATCGCGCTCTACACTGCGT 
TCGCCCGTGAGGAGTGAGCCTCTCTCGGGCGCAGCCGAGCGGCGGCGTGC 
CGGTGGTTCCCTCTTCGCAACCATGACCGGAGCCGCGCTCGGTCCGCGCA 
GCGGCTAGCGCGCGTCGCGGCAGAGATCGCTGGAGCGACAGGCGACGACC 
1 5 CGCCCGAGGGTGTCGAACGGATTGCCGCAGCCCTCATTGCGGATCCCCTC 
CAGACACTCGTTCAGCTGCTTGGCGTCGATGCCGCCTGGGCACTCGCCGA 
AGGTCAGCTCGTCGCGCCACTCGGATCGGATCTTGTTCGAGCACGCGTCC 
TTGCTCGAATACTCCCGGTCTTGTCCGATGTTGTTGCACCGCGCCTCGCG 
GTCGCACCGCGCCGCCACGATGCTATCGACGGCGCTGCCGACTGGCACCG 
20 GCGCCTCGCCCTGCGCGCCACCCGGGGTTTGCGCCTCCCCGCCTGACCGC 
TTTTCGCCGCCGCACGCCGCGAGCAGGCTCATTCCCGACACCGAGATCAG 
GCCCACGACCAGCTTCCCAGCAATCTTTTGCATGGCTTCCCCTCCCTCAC 
GACACGTCACATCAGAGACTCTCCGCTCGGCTCGTCGGTTCGACAGCCGG 
CGACGGCCACGAGCAGAACCGTCCCCGACCAGAACAGCCGCATGCGGGTT 
2 5 TCTCGCAACATGCCCCGACATCCTTGCGACTAGCGTGCCTCCGCTCGTGC 
CGAGATCGGCTGTCCTGTGCGACGGCAATATCCTGCGATCGGCCGGGCAG 
GAGGTACCGACACGGGCGCCGGGCGGGAGGTGCCGCCACGGGCTCGAAAT . 
GTGCTGCGGCAGGCGCCTCCATGCCCGCAGCCGGGAACGCGGCGCCCGGC 
CAGCCTCGGGGTGACGCCGCAAACGGGAGATGCTCCCGGAGAGGCGCCGG 
30 GCACAGCCGAGCGCCGTCACCACCGTGCGCACTCGTGAGCTCCAGCTCCT 
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CGGCATAGAAGAGACCGTCACTCCCGGTCCGTGTAGGCGATCGTGCTGAT 
CAGCGCGTTCTCCGCCTGACGCGAGTCGAGCCGGGTATGCTGCACGACAA 
TGGGAACGTCCGATTCGATCACGCTGGCATAGTCCGTATCGCGCGGGATC 
GGCTCGGGTTCGGTCAGATCGTTGAACCGGACGTGCCGGGTGCGCCTCGC 
5 TGGGACGGTCACCCGGTACGGCCCGGCGGGGTCGCGGTCGCTGAAGTAGA 
75 CGGTGATGGCGACCTGCGCGTCCCGGTCCGACGCATTCAACAGGCAGGCC 

GTCTCATGGCTCGTCATCTGCGGCTCGGGTCCGTTGCTCCGGCCTGGGAT 
GTAGCCCTCTGCGATTGCCCAGCGCGTCCGCCCGATCGGCTTCTCCATAT 
GTCCTCCCTGCTGGCTCCTCTTTGGCTGCCTCCCTCTGCTGTCCAGGAGC 
20 io GACGGCCTCTTCTCCCGACGCGCTCGGGGATCCATGGCTGAGGATCCTCG 

CCGAGCGCTCCTTGCCGACCGGCGCGCCGAGCGCCGACGGGCTTTGAAAG 
CACGCGACCGGACACGTGATGCCGGCGCGACGAGGCCGCCCCGCGTCTGA 
TCCCGATCGTGACATCGCGACGTCCGCCGGCGCCTCTGCAGGCCGGCCTG 

25 

AGCGTTGCGCGGTCATGGTCGTCCTCGCGTCACCGCCACCCGCCGATTCA 
15 CATCCCACCGCGGCACGACGCTTGCTCAAACCGCGGCGAGACGGCCGGGC 
GGCTGTGGTACCGGCCAGCCCGGACGCGAGGCCCGAGAGGGACAGTGGGT 
30 CCGCCGTGAAGCAGTGAGGCGATCGAGGTGGCAGATGAAACACGTTGACA 

CGGGCCGACGAGTCGGCCGCCGGATAGGGCTCACGCTCGGTCTCCTCGCG 
AGCATGGCGCTCGCCGGCTGTGGCGGCCCGAGCGAGAAAATCGTGCAGGG 
2 0 CACGCGGCTCGCGCCCGGCGCCGATGCGCACGTCGCCGCCGACGTCGACC 
CCGACGCCGCGACCACGCGGCTGGCGGTGGACGTCGTTCACCTCTCGCCG 
CCCGAGCGCATCGAGGCCGGCAGCGAGCGGTTCGTCGTCTGGCAGCGTCC 
GAGCTCCGAGTCCCCGTGGCAACGGGTCGGAGTGCTCGACTACAACGCTG 
CCAGCCGAAGAGGCAAGCTGGCCGAGACGACCGTGCCGCATGCCAACTTC 

2 5 GAGCTGCTCATCACCGTCGAGAAGCAGAGCAGCCCTCAGTCTCCATCTTC 
TGCCGCCGTCATCGGGCCGACGTCCGTCGGGTAACATCGCGCTATCAGCA 
GCGCTGAGCCCGCCAGCAGGCCCCAGAGCCCTGCCTCGATCGCCTTCTCC 

45 ATCATATCATCCCTGCGTACTCCTCCAGCGACGGCCGCGTCGAAGCAACC 

GCCGTGCCGGCGCGGCTCTACGTGCGCGACAGGAGAGCGTCCTGGCGCGG 

3 0 CCTGCGCATCGCTGGAAGGATCGGCGGAGCATGGAGAAAGAATCGAGGAT 
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CGCGATCTACGGCGCCATCGCAGCCAACGTGGCGATCGCGGCGGTCAAGT 
TCATCGCCGCCGCCGTGACCGGCAGCTCGGCGATGCTCTCCGAGGGCGTG 

10 

CACTCCCTCGTCGATACTGCAGACGGGCTCCTCCTCCTGCTCGGCAAGCA 
CCGGAGCGCACGCCCGCCCGACGCCGAGCATCCGTTCGGCCACGGCAAGG 
5 AGCTCTATTTCTGGACGCTGATCGTCGCCATCATGATCTTCGCCGCGGGC 
15 GGCGGCGTCTCGATCTACGAAGGGATCTTGCACCTCTTGCACCCGCGCCA 

GATCGAGGATCCGACGTGGAACTACGTCGTCCTCGGCGCAGCGGCCGTCT 
TCGAGGGGACGTCGCTCATCATCTCGATCCACGAGTTCAAGAAGAAGGAC 
GGACAGGGCTACCTCGCGGCGATGCGGTCCAGCAAGGACCCGACGACGTT 

20 

10 CACGATCGTCCTGGAGGACTCCGCGGCGCTCGCCGGGCTCACCATCGCCT 
TCCTCGGCGTCTGGCTCGGGCACCGCCTGGGAAACCCCTACCTCGACGGC 
GCGGCGTCGATCGGCATCGGCCTCGTGCTCGCCGCGGTCGCGGTCTTCCT 
CGCCAGCCAGAGCCGTGGGCTCCT CGTGGGGGAGAGCGCGGAC AGGGAGC 

25 

TCCTCGCCGCGATCCGCGCGCTCGCCAGCGCAGATCCTGGCGTGTCGGCG 
1 5 GTGGGGCGGCCCCTGACGATGCACTTCGGTCCGCACGAAGTCCTGGTCGT 

GCTGCGCATCGAGTTCGACGCCGCGCTCACGGCGTCCGGGGTCGCGGAGG 
30 CGAGGGAGCGCATCGAGACCCGGATACGGAGCGAGCGACCCGACGTGAAG 

CACATCTACGTCGAGGCCAGGTCGCTCCACCAGCGCGCGAGGGCGTGACG 

CGCCGTGGAGAGACCGCGCGCGGCCTCCGCCATCCTCCGCGGCGCCCGGG 
2 0 CTCAGGTGGCCCTCGCAGCAGGGCGCGCCTGGCGGGCAAACCGTGCAGAC 

35 

GTCGTCCTTCGACGCGAGGTACGCTGGTTGCAAGTCGTCACGCCGTATCG 
CGAGGTCCGGCAGCGCCGGAGCCCGGGCGGGCCGGGCGCACGAAGGCGCG 
GCGAGCGCAGGCTTCGAGGGGGGCGACGTCATGAGGAAGGCCAGGGCGCA 
TGGGGCGATGCTCGGCGGGCGAGATGACGGCTGGCGTCGCGGCCTCCCCG 

40 

2 5 GCGCCGGCGCGCTTCGCGCCGCGCTCCAGCGCGGTCGCTCGCGCGATCTC 

GCCCGGCGCCGGCTCATCGCCTCCGTGTCCCTCGCCGGCGGCGCCAGCAT 
GGCGGTCGTCTCGCTGTTCCAGCTCGGGATCATCGAGCGCCTGCCCGATC 
45 CTCCGCTTCCAGGGTTCG ATTCGGC CAAGGTGACGAGCTCCGATATCGCG 

TTCGGGCTCACGATGCCGGACGCGCCGCTCGCGCTCACCAGCTTCGCGTC 

3 0 CAACCTCGCGCTGGCTGGCTGGGGAGGCGCCGAGCGCGCCAGGAACACCC 
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CCTGGATCCCCGTCGCCGTGGCGGCCAAGGCGGCCGTCGAGGCGGCCGTG 
TCCGGATGGCTCCTCGTCCAGATGCGACGGCGGGAGAGGGCCTGGTGCGC 

10 

GTACTGCCTGGTCGCCATGGCGGCCAACATGGCCGTGTTCGCGCTCTCGC 
TCCCGGAAGGGTGGGCGGCGCTGGGGAAGGCGCGAGCGCGCTCGTGACAG 
5 GACGGGCGCGGGCAGCCCCGGCCATCGGAGGCCGGCGTGCACCCGCTCCG 

15 TCACGCCCCAGCCCGCGCCGCGTGATCTCCCGCGGACAGGQCGCGTACCG 

TGGACCCCGCACGCGCCGCGTCGACGGACATCCCCGGCGACCCGCGCGGC 
GCGACCCGCGCAACTCCGGCCCGCCGCCGGGCATCGACATCTCCCGTGAG 
CAAGGGCACTCCGCTCCTGCCCGCGTCCGCGAACGATGGCTGCGCTGTTT 

20 10 CCACCCTGGAGCAACTCCGTTTACCGCGTGGCGCTCGTCGGGCTCGTCGC 

CTCGGCGGGCGGCGCCATCCTCGCGCTCATGATCTACGTCCGCACGCCGT 
GGAAGCGATACCAGTTCGAGCCCGTCGATCAGCCGGTGCAGTTCGATCAC 
CGCCATCACGTGCAGGACGACGCCATCGATTGCGTCTACTGCCACACCAC 

25 

GGTGACCCGCTCGCCCACGGCGGGGATGCCGCCGACGGCCACGTGCATGG 
1 5 GGTGCCACAGCCAGATCTGGAATCAGAGCGTCATGCTCGAGCCCGTGCGG 
CGGAGCTGGTTCTCCGGCCACGCCGATCCCGTGGAACCGGGTGAAACTCC 
30 GTGCCCGACTTCGTCTATTTCAACCACGCGATCCACGTGAACAAGGGCGT 

GGGCTGGCGTGAAGCTGCCACGGGCGCGTGGACGAGATGGCGGCCGTCTA 
CAAGGTGGCGCCGATGACGATGGGCTGGTGCCTGGAGTGCCATCGCCTGC 
2 0 CGGAGCCGCACCTCCGCCCGCTCTCCGCGATCACCGACATGCGCTGGGAC 
CCGGGGGAGCGGAGGGATGAGCTCGGGGCGCAGCTCGCGAAGGAATACGG 
GGTCCGGCGGCTCACGCACTGCACAGCGTGCCATCGATGAACGATGAACA 
GGGGATCTCCTTGAAAGACGCAGATGAGATGAAGGAATGGTGGCTAGAAG 
CGCTCGGGCCGGCGGGAGAGCGCGCGTCCTACAGGCTGCTGGCGCCGCTC 

2 5 ATCGAGAGCCCGGAGCTCCGCGCGCTCGCCGCGGGCGAACCGCCCCGGGG 
CGTGGACGAGCCGGCGGGCGTCAGCCGCCGCGCGCTGCTCAAGCTGCTCG 
GCGCGAGCATGGCGCTCGCCGGCGTCGCGGGCTGCACCCCGCATGAGCCC 

45 GAGAAGATCCTGCCGTACAACGAGACCCCGCCCGGCGTCGTGCCGGGTCT 

CTCCCAATCCTACGCGACGAGCATGGTGCTCGACGGGTATGCCATGGGCC 

3 0 TCCTCGCCAAGAGCTACGCGGGGCGGCCCATCAAGATCGAGGGCAACCCC 
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GCGCACCCGGCGAGCCTCGGCGCGACCGGCGTCCACGAGCAGGCCTCGAT 
CCTCTCGCTGTACGACCCGTACCGCGCGCGCGCGCCGACGCGCGGCGGCC 
AGGTCGCGTCGTGGGAGGCGCTCTCCGCGCGCTTCGGCGGCGACCGCGAG 
GACGGCGGCGCTGGCCTCCGCTTCGTCCTCCAGCCCACGAGCTCGCCCCT 
CATCGCCGCGCTGATCGAGCGCGTCCGGCGCAGGTTCCCCGGCGCGCGGT 
TCACCTTCTGGTCGCCGGTCCACGCCGAGCACGCGCTCGAAGGCGCGCGG 
GCGGCGCTCGGCCTCAGGCTCTTGCCTCAGCTCGACGTCGACCAGGCCGA 
GGTGATCCTCTCGCTGGACGCGGACTTCCTCGCGGACATGCCGTTCAGCG 
TGCGCTATGCGCGCGACTTCGCCGCGCGCCGCCGCCCCGCGAGCCCGGCG 
GCGGCCATGAGCCGCCTCTACGTCGCGGAGGCGATGTTCACGCCCACGGG 
GACGCTCGCCGACCACCGGCTCCGCGTGCGGCCCGCCGAGGTCGCGCGCG 
TCGCGGCCGGCGTCGCGGTGGGAACTCGTGCACGAGTCTTTGTCTTGCGC 
CCTGTCCGGGAATAACGGACACCTTATCGCGGGTCGCTCTTTGTGCGCGG 
CTTCTGTACCTCTCAGGACAGGTAGAAGAGGGACTCAGGGGCCCTTATGT 
TAACTGGGGATGCCTTCGGGACGGCCGCAAATATATCCTATCACCTCACT 
GGGTGTGGGGGAGCACCGCGAGGATGTACAACCTCTGTAACTCTATGTGA 
GATAATGTGTGCAGTGATCTGAGACTTATTTGTGTGACCGAGACGTCTCT 
CTTATTGGTACGCATAGTATAATATAACACGTCTCATAGATACTCCCGAC 
ATATCCGCGGTATGCGCGCACATAGAATAGGTGATGATAAATCCCTAGTG 
TGTGGAACTAGAAGATGCGGGAGTTACCTGATATTTACGGAAAAAGTATT 
ATCTCAACTACCTCTCTGTTGAGACTATCACTTCGGTGTCGTTGTGCTGC 
TGGT, 

or its complementary strand, 

(b) DNA- sequences which hybridise under stringent conditions 
to regions of DNA- sequences according to (a) encoding proteins 
or to fragments of said DNA- sequences, 
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(c) DNA- sequences which hybridise to the DNA- sequences accord- 
ing to (a) and (b) because of a degeneration cf the genetic 
code, 

(d) allele variations and mutants resulting by substitution, 
insertion or deletion of nucleotides or inversion of nucleotide 
segments of DNA- sequences according to (a) to (c) , wherein the 
variations and mutants offer isof unctional expression products. 

11. Peptide encoded by a DNA sequence according to claim 10 
selected from the group consisting of 

Seq ID No 83 

>Contig56_003 2890 amino acids MW=307428 D pl = 5 . 76 numambig=13 

IRPRAAAVPMRSTVTGGVIAGPELGASYWADNLRQPVRFAAAAQALLEGGPALFIEMSPH 

PILVPPLDEIQTAAEQGGAAVGSLRRGQDERATLLEALGTLWASGYPVSWARLFPAGGRR 

VPLPTYPWQHERCWIEVEPDARRLAAADPTKDWFYRTDWPEVPRAAPKSETAHGSWLLLA 

DRGGVGEAVAAALSTRGLSCTVLHASADASTVAEQVSEAASRRNDWQGVLYLWGLDAVVD 

AGASADEVSEATRRATAPVLGLTOFLSAAPHPPRFWVVTRGACTVGGEPEASLCQAALWG 

LARVAALEHPAAWGGLVDLDPQKSPTEIEPLVAELLSPDAEDQLAFRSGRRHAARLVAAP 

PEGDVAPISLSAEGSYLVTGGLGGLGLLVARWLVERGARHLVIiTSRHGLPERQASGGEQP 

PEARAR I AAVEGLEAQGARVTVAAVDVAE AD PMTALLAA I EPPLRG WHAAGVF P VRHLA 

ETDEALLES VLRPKVAGSWLLHRLLRDRPLDLFVLFS SGAAVWGGKGQGAYAAANAFLDG 

I^HRRAHSLPALSLAWGLWAEGGMVI)AKAHARLSDIGVLPMATGPALSALERLVNTSAV 

QRSVTRMDWARFAPVYAARGRRNLLSALVAEDERAASPPVPTANRIWRGLSVAESRSALY 

ELVRGIVARVLGFSDPGAIJ3VGRGFAEQGLDSLMALEIRNRLQRELGERLSATLAFDHPT 

VERLVAHIJjTDVljKLEDRSDTRHIRSVAADDDIAIVGAACRFPGGDEGLETYWRHLAEGM 

WSTEVPADRWRAADWYDPDPEVFGRTYVAKGAFLRDVRSLDAAFFAI S PREAMSLDPQQ 

RLLLEVSWEAIERAGQDPMALRESATGVFVGMIGSEHAERVQGLDDDAALLYGTTGNLLS 

VAAGRLSFFLGLHGPTMTVDTACSSSLVALHLACQSLRLGECDQALAGGSSVLLSPRSFV 
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AASRMRLLSPDGRCKTFSAAADGFARAEGCAVWLKRLRDAQRDRDPILAVVRSTAINHD 
GPSSGLTVPSGPAQQALLRQALAQAGVAPAEVDFVECHGTGTALGDPIEVQALGAVYGRG 
RPAERPLWLGAVKANLGHLEAAAGLAGVLKVLLALEHEQ I PAQPELDELNPH I P WAELPV 
AWRRAVPWPRGARPRRAGVSAFGLSGTNAHWLEEAPAVEPVAAAPERAAELFVLSAKS 
AAALDAQAARLRDHLE KHVELGLGD VAFS LATTRS AMEHRLAVAAS S REALRGALS AAAQ 
GHTPPGAVRGRASGGSAPKVVFVFPGQGSQWGMGRKLMAEEPVFRAALEGCDRAIEAEA 
GWSLLGELSADEAASQLGRIDWQPVLFAMEVALSALWRSWGVEPEAWGHSMGEVAAAH 

VAGALS LEDAVAI I CRRSRLLRR 1 5 GQGEMALVELSLEEAEAALRGHEGRLS VAVSNS PR 

STVU^GEPAALSEVLAALTAKGVFVJRQVKVDVASHSPQVDPLREELIAALGAIRPRAAAV 

PMRSTVTGGVIAGPELGASYWADKLRQPVRFAAAAQALLEGGPALFIEMSPHPILVPPLD 

EIQTAAEQGGAAVGSLRRGQDERATLLEALGTLWASGYPVSWARLFPAGGRRVPLPTYPW 

QHERYWIEDSVHGSKPSLRLRQLRNGATDHPLLGAPLljVSARPGAHLWEQALSDERLSYL 

SEHRVHGEAVLPSAAYA^MALAAGVDLYGTATLVLEQLALERALAVPSEGGRIVQVALSE 

EGPGRASFQVSSREEAGRSWVRHATGHVCSGQSSAVGALKEAPWEIQRRCPSVLSSEALY 

PLLNEHALDYGPCFQGVEQVWLGTGEVLGRVRLPGDMASSSGAYRIHPALLDACFQVLTA 

LLTTPESIEIRRRLTDLHEPDLPRSRAPVNQAVSDTWLWDAALDGGRRQSASVPVDLVLG 

S FHAKWEVMERLAQAY 1 1 GTLR I V7NVFCAAGERHT IDELLVRLQ ISWYRKVI KRWMEHL 
VAIGILVGDGEHFVSSQPLPEPDLAAVLEEAGRVFADLPVLFEWCKFAGERLADVLTGKT 

LALE I LFPGGS FDMAER I YRDS P I AR Y SNGI VRGWES AARWAPSGMF S I LE I GAGTGA 
TTAAVLPVLLPDRTEYHFTDVSPLFLARAEQRFRDYPFLKYGILDVDQEPAGQGYAHQRF 
DVIVAANVIHATRDIRATAKRLLSLLAPGGLLVLVEGTGHPIWFDITTGLIEGWQKYEDD 

LRI DHPLLPARTWCDVLRRVGFAC AVSLPGDGS PAG I LGQHV I LSRAPG I AGAACDS SGE 

SATESPAARAVRQEWADGSADVVHRMALERMYFHRRPGRQVWVHGRLRTGGGAFTKALAG 

DLLLFEDTGQWAEVQGLRLPQLEASAFAPRDPREEWLYALEWQRKDPIPEAPAAASSSS 

AGAWLVLMDQGGTGAALVSLLEGRGEACVRVIAGTAYACLAPGLYQVDPAQPDGFHTLLR 

DAFGEDRICRAWHMWSLDATAAGERATAESLQADQLLGSLSALSLVQALVRRRWRNMPR 

LWLLTRAVHAVGAEDAAASVAQAPVWGLGRTLALEHPELRCTLVDVNPAPSPEDAAALAV 

ELGASDREDQ VALRSDGR Y VARLVRS S FSGKPATDCG I RADGS YVI TDGMGR VGLS VAQW 

MVMQGARHWLVDRGGAS EASRDALRSMAEAGAEVQ I VEADVARRDDVARLLS KI E P SMP 

PLRGIVYVDGTFQGDSSMLELDARRFKEWMYPKVLGAWNLHALTRDRSLDFFVLYSSGTS 
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LLGLPGQGSRAAGDAFLDAIAHHRCKVGLTAMSINWGLLSEASSPATPNDGGARLEYRGM 

EGLTLEQGAAALGRIJjARPRAQVGVMRLNLRQWLEXXXXXXXXXXXXXWYNLLI I IQYTK 
VPFQGPFRML* 

Seq ID No 84 

>Contig56_02 7 700 amino acids MW=80569 D pl=7.02 numambig=0 
MNMELNITSKSNPFGDTTAENDKKMLSNAFIETADFRTLIETDDRTIWGRRGTGKSALF 
IQLNEHWKKDKKILILSFSPDDSQIIGFRSMLKPFTGSFNLARAATRLLWRYAMliMEIAS 
YISSHYKLSSQISSETLLNEHLKKWNSAQGDILRKCRLVAKEYLDENNPEESIGDLQFNL 
NISEI ENNI VSLLERSDRKWI LMDKLDEAYE PDN IG I G 1 1 AGLAYAS I ELNQKAKCI RP 

IIFLRDNIFRSLSKEDPDYSRNIEGQVIRLHWDWAQLLMLSAKRMKVAFKLDIEKDQRVW 
DRCTADDLKGRNGFKRCLQFTLYRPRDLLSLLNEAFFSAFRENRETI INTDLEYAAKSI S 
MARLEDLWKEYQKIFPSIQVITSAFRSIEPELTVYTCLKKIEASFELIEENGDPKITSEI 
QLLKASGILQSLYSVGFVGIRDXNTSSYSFCHDGRTPDKGFESNEKLLIHPCYWLGLNLN 
RNALAPEEAEEINDEYDINI I SDNSAI RNKTIGQITTHLDQI PIGNEGATEFEQWCLDAL 
RI VFASHLTDI KSHPNGNAVQRRDI IGTNGGKSDFWKRVLEDYKTRQWFDAKNFEELGP 
SEYRQLQSYLTGPYGKLGFIINRDESEVLKSGKDLDWTKEMYQSHNSLI IKLPAKYISKL 
LQKLRNPEKHDAIDRQMGKLLTLYETSYMAIKSTQKKRRK* 

Seq ID No 85 

>Contig57_001 372 amino acids MW=38411 D pl=12.39 numambig=10 
M LT S XXXXXXXXXXL LA YR CAT AARG AGR I RHHRQ Y AQ RRRRTWL Y ARAAGTL P D RRHG 
LLWARGHPPCLPQPARSRERSRAGGRRQHAPFVQDDDNAGAHPGAVARWPLPD I RRLGQ 
RVR PWGGLRYGRAQTALRR PATRRSDLGS D PGFGHESGWPVDRVDGTQCAR S GGALARGA 
AERSRRRRGHRLCRDPRNGDLARRPDRGRGAACRVGAGAGRWEPLRAGRSEDKPRPPGGR 
CRRGGFDQGGAGSAPRTDPAKPFFPHAQSADPDRGDRARAGDGAGAVAAGGPTALRGGER 
VRPQRHQRPCRAGGGAGHGARTGDAGALSGAFGAVGEERRRAGRTGGAALSAHRRVPGAG 
SRRRRVQPGIDA* 
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Seq ID No 86 

>Cont ig57_002 225 9 amino acids MW=238258 D pl=5.92 nuTnambig=0 

MS YTLGLQGPCLTVDTVC S S SLVAI HLACRSLRARESDLALAGGVNMLLS S KTM I MLGR I 

QALSPDGHCRTFDASANGFVRGEGCGMWLKRLSDAQRHGDRIWALIRGSAMNQDGRSTG 

LMAPWLAQEALLREALQSARVDAGAIGYVETHGTGTSLGDPIEVEALRAVLGPARADGS 

R C V LGA V KTN LGH LEGAAGVAGL I KAALALHHELI PRNLHFHTLNPRI RI EGTALALATE 

PVPWPRAGRPRFAGVSAFGLSGTNVHVVLEEAPATVLAPATPGRSAELLVLSAKSAAALD 

AQAARLS AH I AAYPEQGLGDVAF S LVSTRS PMEHRLAVAATS REALR S ALEVAAQGQTPA 

GAARGRAASSPGKLAFLFAGQGAQVPGMGRGLWEAWPAFRETFDRCVTLFDRELHQPLCE 

VMWAEPGSSRSSLLDQTAFTQPALFALEYALAALFRSWGVEPELVAGHSLGELVAACVAG 

VFSLEDAVRLVVARGRLMQALPAGGAMVS I AAPEADVAAAVAPHAALVS IAAVNGPEQW 

IAGAEKFVQQI AAAFAARGARTKPLHVSHAFHS PLMDPMLEAFRRVTESVTYRRPS I ALV 

SNLSGKPCTDEVSAPGYWVRHAREAVRFADGVKALHAAGAGLFVEVGPKPTLLGLVPACI, 

PDARPVLLPASRAGRDEAASALEALGGFWWGGSVTWSGVFPSGGRRVPLPTYPWQRERY 

WTEAPVDREADGTGRARAGGHPLLGEVFSVSTHAGLRLWETTLDRKRLPWLGEHRAQGEV 

VFPGAGYLEMALS SGAE I LGDGP I QVTDWLI ETLTFAGDTAVPVQ WTTEERPGRLRFQ 

VASREPGERRAPFRIHARGVLRRIGRVETPARSNLAALRARLHAAVPAAAIYGALAEMGL 

QYGPALRGLAELWRGEGEALGRVRLPEAAGSATAYQLHPVLLDACVQMIVGAFADRDEAT 

PWAPVEVGSVRLFQRSPGELWCHARWSDGQQASSRWSADFELMDGTGAVVAEISRLVVE 

RLASGVRRRDADDWFLELDWEPAALGGPKITAGRWLLLGEGGGLGRSLCSALKAAGHVW 

HAAGDDTSTAGMRALLANAFDGQAPTAWHLSSLDGGGQLGPGLGAQGALDAPRSPDVDA 

DALESALMRGCDSVLSLVQALVGMDLRNAPRLWLLTRGAQAAAAGDVSWQAPLLGLGRT 

IALEHAELRCISVDLDPAEPEGEADALLAELLADDAEEEVALRGGDRLVARLVHRLPDAQ 

RREKVEPAGDRPFRLEIDEPGALDQLVLRATGRRAPGPGEVEISVEAAGLDSIDIQLALG 

VAPNDLPGEEIEPLVLGSECAGRIVAVGEGVNGLWGQPVIALAAGVFATHVTTSATLVL 

PRPLGLSATEAAAMPLAYLTAWYALDKVAHLQAGERVLIHAEAGGVGLCAVRWAQRVGAE 

VYATADTPENRAYLESLGVRYVSDSRSGRFVTDVHAWTDGEGVDWLDSLSGERIDKSLM 

VLRACGRLVKLGRRDDCADTQPGLPPLLRNFSFSQVDLRGMMLDQPARIRALLDELFGLV 

AAGAISPLGSGLRVGGSLTPPPVETFPISRAAEAFRRMAQGQHLGKLVLTLDDPEVRIRA 

P AE S S VAVRADGTYLVTGGLGGLGLRVAGWLAERGAGQLVLVGRS GAAS AEQRAAVAALE 
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AHGARVTVAKADVADRSQ I ERVLRE VTASGMPLRG WHAAGLVDDGLLMQQTPARFRTVM 
GPKVQGALHLHTLTREAPLSFFVLYASAAGLFGSPGQGNYAAANAFLDALSHHRRAQGLP 
ALS I DWGMFTEVGMAVAQ ENRGARQ I SRGMRG I TPDEGLS ALARLLEGDRVQTGV IPITP 
RQWVEFYPATAASRRLSRLVTTQRAVADRTAGDRDLLEQLASAEPSARAGLLQDWRVQV 
SH VLRLPEDKI E VDAPLS SMGMDSLMSLELRNR I EAALGVAAPAALGWTY PTVAA ITRWL 
LDDALWRLGGGSDTDESTASAGSFVHVLRFRPWKPRARLFCFHGSGGSPEGFRSWSEK 
SEWSDLE I VAMWHDRSLAS EDAPGKKYVQEAAS L I QH YADAPFALVGFSLGVRFVMGTAV 
ELASRSGAPAPLAVFTLGGSLISSSEITPEMETDIIAKLFFRNAAGFVRSTQQVQADARA 
DKVITDTMVAPAPGDSKEPPVKIAVPIVAIAGSDDVIVPPSDVQDLQSRTTERFYMHLLP 
GDHEFLVDRGREIMHIVDSHLNPLLAARTTSSGPAFEAK* 

Seq ID No 87 

>Contig57_027 419 amino acids MW=46737 D pl=5.09 numambig=0 
MTQEQANQSETKPAFDFKPFAPGYAEDPFPAIERLREATPIFYWDEGRSWVLTRYHDVSA 
VFRDERFAVSREEWESSAEYSSAIPELSDMKKYGLFGLPPEDHARVRKLVNPSFTSRAID 
LLRAE I QRTVDQLLDARSGQEEFD WRD YAEG I PMRAI S AL LKVPAECDEKFRRFGS ATA 
RALGVGLVPQVDEETKTLVASVTEGUUjLHDVLDERRRNPLENDVLTMLLQAEADGSRLS 
TKELVALVGAI I AAGTDTTI YLI AFAVLNLLRSPEALELVKAEPGLMRNALDEVLRFDNI 
LRIGTVRFARQDLEYCGASIKKGEMVFLLIPSALRDGTVFSRPDVFDVRRDTGASLAYGR 
GPHVCPGVSLARLEAEIAVGTIFRRFPEMKLKETPVFGYHPAFRNIESLNVILKPSKAG* 

Seq ID No 88 

>Contig57_043 492 amino acids MW=52617 D pl=11.54 numambig=0 
MAARARKSCRARGSRPAPMRTSPPTSTPTPRPRGWRWTS FTSRRPSASRPAASGSSSGSV 
RAPSPRGNGSECSTTTLPAEEASWPRRPCRMPTSSCSSPSRSRAALSLHLLPPSSGRRPS 
GNIAL.SAALSPPAGPRALPRSPSPSYHPCVLLQRRPRRSNRRAGAALRARQESVLARPAH 
RWKDRRSMEK£SRIAIYGAIAANVAIAAVKFIAAAVTGSSAMLSEGVHSLVDTADGLLLL 
LGKHRSARPPDAEHPFGHGKELYFWTLI VAI MI FAAGGGVS I YEGI LHLLHPRQI EDPTW 
NYWLGAAAVFEGTSL IIS IHEFKKKDGQGYLAAMRS SKDPTTFT I VLEDSAALAGLT I A 
FLGVWLGHRLGNPYLDGAASIGIGLVLAAVAVFLASQSRGLLVGESADRELLAAIRALAS 



WO 00/22139 



PCI7US99/23535 



186 

ADPGVSAVGRPLTMHFGPHEVLVVLRIEFDAALTASGVAEARERIETRIRSERPDVKHIY 
VEARS LHQRARA * 

12. DNA sequence according to any of claims 1 to 5 wherein the 
DNA is selected from the group consisting of 

(a) the following DNA sequences: 

Seq ID No 89 (>ContiglO) 

GGTAGTGAAATATGCTGTATTCAACAGAAAGCTTGATGAATTGATCTAGA 

AAGTAGAGCGAGAGAATCAAGTAAGATAGTAGGATGCATTATAAATATAG 

AATATATACTGCATACGATGACAGCATGCGCACGAATAGAATGCATAAGA 

GGCAAGCCAATAACCAAAAGTGGAGCCAGAGGAGATAGTCTCGCCAGTAG 

AAATAATGCTCAGCCAAGCGAGGTTGGACATATCAGTTCCAGAGTAGGTC 

TCAACCCCGTATATGAGTCCAATGAAGCCTGTCTCATCCAGTTAACGGCC 

TTTTGAGCAGAGAATCCTCCCTATTTTCGGAGAGGACGCGTCGAATATAA 

AGCAGGTCCAAAGAAGCAAGCAATAGCCAAAAGTTTGAAAGGTTAGTACG 

AGCAGCGGCTGGAGGACACTATGGTCGTGCAACGGGGGTAAAGGGTTTCA 

CGTATTGTAGCAGAGCACGTCAGAGGGTTATTCGTGACATTCGAGGCCAA 

CGAGGCGGTAGGACTTCGTAAGCGCATGACCATCCCGGTCACAAACGTAG 

TGCGGAGCGCCTCGTCACGCTCAACAAGGCCCTAGAACGCGCGGCGCAGA 

TCGACCCTTTTAAACGCCGGCACCGAGCCGGACCGTCCTGCCCAGGTTGT 

AAAGCGCTCCATCGGCCGACTTATGGCACTCGAGCCAAATCGCCCGGTTC 

CCCATCGGTCAGCGCAAACGGCCCCCCCGGGCGTCGCCACCCGCGGCGAC 

GAGGGGCCGTCCAGACGGGTGATCTCTCTCGTGAGCTCGCGGAGAGAGCC 

TCCTCGCAAGATCGATGTCAGCGGGATCGCGCGCCCCGTCCGCACCTGAA 

ACGCGTGCTGGAGCTCGACGGCAGCGAGGGAGTCGAGGCCGAACCGCGAT 

ATCGGCAGCGCGTCGTCGATCTGCCCGGCGTCCAGACGAAGCGCGCGGGC 

GAGGGTCGAGCGCAGCGCGTCCAGCAGGCTCCGGCCGGAGGGCTCCTCGG 

TCTCCGGGGGCGCGTCGTCCGGGGGCGAGGCGTCGTCGAGGAGCTCCGGC 
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GCGAACGCGACGTGGCGCTCGCCGAGCGCGTCCTCGAGAAAGGCGCGCCG 
GCACTCCCTCCGGCGGACCTTCCCGCTCGACGTCTTCGGCAGCGCGCCCG 
GCGCGATCAGCGCGACGGCGTGCGCGACGAGCTGGTGCTCGGCGGTCACC 
GCCTCGCGCACGGCCGCCACGATCTCGCGCGGATCCGCGGCCACGCGCGG 
5 GTCGACCTCGCACACCACGGCGAGGCGCTCCTCGCCCTCGTGCTCCACGG 
15 AGAACGCGGCGCTGCAGCCCGGCCGGACGGCGCGATGGCTGCTCTCGACG 

GTCTTCTCGATGTCCTGCGGGAAGTGGTTGCGGCCTCGAAGGATGATGAG 
GTCCTTCGACCTCCCCACCACGAACAGCTCGCCGCCCCGGAGGAAGCCGA 
GATCTCCCGTGCGCAGGTAGCGCGGCGCCGCGCTGCCAGCGAGCGTGGCC 
20 10 CCGAACGTGGCCTCCGTCTCCTCCGGGCGCCCCCAGTAGCCGACGGCTAC 

GCTGGGCCCGGACACCCAGATCTCCCCGATCTCCCCCGGCCCGAGCTCGT 
TCCCCGCGGGATCGACGATCGCGACCGCCCGCGGATCGAGCGCCCGACCG 
CTGCCGACGAACACGCGCGCGCCCTCCGCCGCCGACGCGACGGCGCGCCC 

25 

GAGCTCCACCTCCTCGGGGGCGAGGCGCGCCAGCACCGGCGCCTCGGCCC 
15 GCGCTCCGCCGCTCACGATGAGCGTGGCCTCGGCGAGCCCGTAGCAGGGA 
TAGAACGCCTCTCGCCGGAACCCGCTGACCGCGAAGGCGCGCGCGAAGCG 
30 ATCGAGCGTGTCGGCGCGCACCGGCTCGGCGCCCGTGAACGCGACCTCCC 

ACGACCGCAGATCGAGCGCCGCTCGCTCCTCCTCCGAGCTCTTCCGGACG 
CACAGGTCGTATGCGAAGTTCGGGCCGCCGCTCACCGAGGCGCCGAGCGC 
2 0 CGAGACGGCGCGGAGCCACCGCATC3GCCTCTGCAGGAACGAGAGCGGCG 
35 ACATGAGCGCGACGCGGATCCGCCGGTAGAGCGCCTGCAAGATCCCGCCG 

ATGAGCCCCATGTCGTGATACGGCGGCAGCCAGATCACCCCGACCGGATC 
CGGGCTCGTCAGGTCGAATCCATGCGCGATGAGCCGCGAGTTGTGCAGCA 
GATTCCCGTGGGTGAGCATCACCCCCTTGGGCTCGCCGGTCGAGCCGGAG 

40 

2 5 GTGTATTGAAGGAACGCGACCGACTCCGGCCGGAGCGCCGCGCCCGGCCC 

CTCGATCGGGCCCGGCGACGGGCCGTCGGTCGCGATCCACCGGAGCCGCT 
GCAGCGCGGCGGCCGCGGCGCTGGCCGGCAGGGACGCCACGATGCCGGCG 
45 ACGGCCGATGACGTGAGCGCCGCCTCGGCGCGCGCGTCCGCGACGATGGA 

AGCGACGCGCGGCAGCGTCCGCTCGAGCCGGCCGAGATCCGGCGGATAGG 

3 0 " CGGGCACGGTCCGGACTCCAGCGTAAAGACACCCGAAGAACGCGGTGATG 
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TACTCGATCCCCGGCGGATACAGCAGCAGCGCGCGGGCCCCGGGGGCGAC 
GCCCGATGCCTGCAAGAGGGCCGCGACGGTTCGCGCGCGCTCGTCAATTT 

10 

CCCGCAGGGTCACCCAGGTCGCCCCGGCCTCGACGTCGCCGGACTCAAGA 
AAGCAATAGATTGGGCGGGCGGGCTCAGCTTCGGCCCGCTGGCGCAAGAG 
5 GTCGATAACGGTGGAAGGGCGGTTCCGTTCGTTCCGTTCCAATGCAAGAA 

t5 AAGCATCATTCATTGAACAGACCCCTCCGCCGCGGAGATAGCAGCTTGTC 

CGCTGCGACACAACCGCCGCGCGACGCGCGTGGCACGGCGGGATCCGGGC 
GTTACTCCACCTGCACTTCCCGTCGCGTCACGCTCGCTCCGCCGCGGGTG 
TCGTGAACCACCGCCCACAGCGACACGCGCCCTGGCTCCGAGGGCGGCGT 

20 10 CCACGTGGTCCCGTTGCCCCCGCGGGAGGCGCCGGTCGTATCGCTCACCA . 

GGCGGCGCGCCCCGTCGAACTCGCCACCGTCCGTGTAATAGTCGACCCAG 
ATCGCCTCGCGCGCCGGTGGACCGCCGAGCCCGGCGGCTTCCTCGTCCAC 
CTCGGCGGCCTTCTCGGGCACGACAGCCTCAATCTCATAGGTCGTGCACT 

25 

CGTCCTCGGCCGGCTCGGTCCGGCCGCAGCCTTGGGCCTGCTCCTCGGAC 
1 5 CGAACGCACCGCTTCACGACGGGCAAACCGTCCTCGCCCGGCGCGACCTC 
ATTGCCATCGAGCTTCAGCGTGAAGCCGTCGATGGGCGGGTTCGTGTTCA 
30 GCCGCTCCTTCTTGAAGACATAGACCTGCGTGTAGCCCACGACGAAGCTG 

TCCGGACCGAGCACCGTCCCGTCGTCGCCGACGCACTCCAGCGGAAACCC 
GGCCGTTTCGGGCGCCGAAGCCACGCGTGTCGTGCCGGCGCACACGGCGA 
20 ACAGCACGTAAGCCGACGAGTACACCGTCCCCGTCTCGGTGGGCCTCGCG 
35 TCCTTGAGGATCTCCTTGGGCAGCTTCCACCCGAACGAGACCGCATCGGG 

CTCGCCGCTCTTCTCCGGACCGATCTCCTGCTGCGCGAAGGGGACGGTGC 
GCTCCCCGTCGCCGCTGCCGCTGCCGCCGTCGCCGCTGCTGCCGCCGTCG 
CCGCTGGCACCGCCACCGCCGCCGCCGCCAGCGCCACCATTGCCGCCTTC 

40 

25 GCCGCCTCCAGCGCCACCACTGCCGCCGTCGCCGCCGCTGCCACCATCGC 
CACCACTGCCGCCGCCGCCGCTTCCGCCGCTGCCGGCCGGCACCGCCTCC 
CGGATTCGCGACGATTCCCACCGCATAGGTGCCCAGCCACTGCGGGATGC 
45 ACCCGAGGTGCTCGTCCACCCCGACCGGCGGATTCACGCAGCCGCCCACC 

CACGTGACCTCGACCTTCCGCGGCGCGCCGCCCTCCGCGCCTTTCGCGTC 
30 GGCGTACGTCATCCGGAACGTCACGAGCTCTTCCGCCGCCGCGTACGGCT 
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TGTCCGCCGTCACGGCGAGGACGCGGAGCCCCTTCACCTCGGACGAAGGG 
GCCATGTCGCTCCCGGCGCAGCAAGGGATGCCCACGGCCAGGGTCGAGAG 
CGCGGCCAGCAGCGCGCGACGAGCGGGCAGTGCGGTCCGTTTCATCAGAA 
ATCTCCTCGCAGCCCGAGCGTGGGCAGGAAGGGGAGCCCCGTCACGTACT 
5 CCCGCTTCGTGTAGTTGAAGTTGTAGCTGATGCCCTCCGCAGCCATGTAA 
15 TTGTAGACGTTCTGGATATCGAGGTAGAGCCCGAGCTGCCACCTCTTGAA 

TTTCCACGTCTTGTCGGCGCGGATGTCGAGCTGGTGAAACAGCGGCATCC 
GCTCGCTGTAGTCACCCCCGAGCGGGATCGGCGAATACCTCGCCGAGGAC 
GCGTQGTAGATCGCGTTCACCCGGTTCGGATTGCACCCCTTCTCCTCCGG 
20 10 ATCGCAGACATAGGGCGTCTGCAGGTTGCCCGACACGAGCCGGAAGCGCG 

CGCCCAGCTCCCAGCCCCGGCCGAGCCGCAGGCTCCCGAGCACCGTCAGC 
ACGTGCGTCTGATCGAACTGGGTGAGGTGCTCCTCCTCGTCGGGGCCGTC 
CTTGCGCACCGACCGCGAGAGGGTGTACGCCGCCCAGCCGAAGAAGCGCT 

25 

CGTCCGGCTTGTACTTCAACAAGAGCTCGCCGCCGACCGCGTATCCGGTG 
15 CCATCGTTGGCATAGTCGTCCTTCTCCGGCGAGAAGACGACCAGCCGATC 
GAGCTGCTTGTAGAACCCGTCCAGCGTCACCTCGATCTGCGGCGTGATCT 
30 CCTGCTCCACGCCGAGGCCGTAATGCACGGCGCGGTTCGACTTGAGCTCC 

GCATTGCCGAACGGCTCGATGCTCTCCGCGAACTGCGGCGCCTGATAATA 
AAGGCCCACGCCCCCCTTGGCCGTCGTCCGCGGGAAGCCGCTCCGGATGT 
20 CGTAGCGCGCGTTGACCCGCGGGCTCACGTCGAGCGTCTGCGTATCGAGC 
35 . GCGTAGTCGACCCGCACCCCGGGGACGATCCGCGCCCGCGGCGAGGGGAC 

GACCTCGAGCTCGGCATACGCCGCGGGCCGCGAGTACGCGCCGTCGAACG 
ACCGATCCTGGAACGGGTACGTCGAGAACGGCTGGTTCGACGGGTGGCCC 
GCGGGCTGCTGCGACGGCGCGCGGATGTTGACCGTGGCGACGCCGCCCGA 
GAGGTCGGTGCCGACGTTCATCGTGAGGTACCGCGCGAACCTGTGCGAGA 
GCTCCAGCCGCAGGTCGAGCGAGGTCGAGACGACGTTGAAGGCGAGGGGA 
GAGATCTCGAAGTCGGCGATGTCCCGGCCGAGCGCCATCGACCACAGCAG 
45 CCGATCCCGGCTCCCGATCCGGTTCTCGTAGCTGAGCTGGAAGCGCTGGA 

AGGCGGTGTGCAGCCCGAAATCGCCCGTCAGCGCCGGCTCGTCCTCCGGC 
3 0 GGCTTGTCCAGGGTGATCTTGAAGGCGTCGTCCGATCCGTAGAAGCTCGC 
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10 

GGCCCGAGCCACGCGTCGACCCAGCTGCGGCGGCCCGCCGCGATGAACGT 
CCAGTCCTTGAGGAACGGGACGGGGCCCTCGAGGAGCACGGGCCCGTCGA 
5 TGAGGTCGAGCTGGACCACGCCGTGGTACTTGCCGTCCTGCTTCGGCGAG 

15 CGGAGCCCGACGTCGACGATGCCGCCCATGGCGCGGCCGTACACGGCGCT 

GAAGTTGCCCGGATAGAAGTCGATCTTCTCGAGCATCTCGGTCGGCACGA 
CCGAGGAGAGGCCGCCGAAGTGGTAGATGATCGGCACCGGGGTGCGATCG 
ACGAACGTGAGCGTGTCCTGGGGCGCGGACCCGCGCACGATGAGCAGCCC 

20 10 GAAGCCGCTGCGCGCGACGCCCGGCAGGCTCTGCAGCGACCGCAGCGCGT 

CGCCGCCGGTGCCGGGGATGCGGTCGATCTCGCGGCGCTCGATCGTCCTC 
CGCGTCACCTCGCGCGGCGGGCGCTCGCCCTGCACGGTCACCTCGATGCC 
CGGCGCCTTGCCGTCCTGCGGCGCGGCGAGCGAGATGCGGTAGCGCACCT 

25 

CGATCGCCTCGCCGGCCGCGATCTCCTCCTCGGCGGCGAACGGCTCGAAC 
1 5 CCCGCGGCGGCGACCTCGACGCGGTACTTGCCGGGGGGGAGATTCTTGAA 
GCGGAACTTGCCGCCCTGGTCCGTCTTCGCCTCCTCGCGGCCGCCGTCGG 

30 GGCGCACGAGGGTGACCGCGATGTCCGGGAGCGGCTCGCCGGTGCCCGCG 

GACAGGACGGTCCCGACCACCGTCTCGACGTCGGCGGGCGGCGCCGGCGC 
GGCCGCATCGGCGGGCTTGGGCGTGAGCGTGAACGCGTACCGGTAGAGGA 
20 TGCGCGCCGCCGCGGGCGTGCCGTCCGGGCGCCGCGCCGGCGCGAACTCC 

35 AGGCCGGGCGCGGCCCGCGAGCGCCGCCTCGTTGAAGCCGTGCCCGCCGG 

GCGTCGCGACCTCGGCCTTGGTGACGCGCCCGGTCTTGTCGATGTCGAGC 
TTGAGGATGACGCTGCCCTCGACGCCGGCGCGCTGGGCCTCGATCGGATA 
CGCGGGCGGGGAGTACTTGATCAGCGTCGGCGGGCTGATGGCGGCGGGCG 

40 

25 CCGGCGGGGGCGCGCCCGGCTGAGGGACGACGACCGCGCCGGCGCCGCCG 
. CGGGGGACCGAGGCGCCGCCCGAGTCGCCCTCGGCGGCAGGAGGAGGCTC 
GGGCGGGGGCGCGCCGGCGGGCTGCGCGCGCGCTGCGCTGCCGGTCATCG 
45 CGACCGCGAGCAGCAGCGCTTCCGAGACGACGAGGCGCATCACGGAGGAC 

GCTGTGGAAGGCATGCGGCCCGCCCTCTCGCATGGCGAGGCCGAGGCGGA 
30 AAGACGCATCGCGCAGCCAGGACCGTGCTTCACATTGCTTCACACAACGG 
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GCGCCGCGCGCGCTCCCGGGCCGCGCGAGCGCAGGCGGCGCGCGCGCCCG 
CGGGCGGCGCGATCGCGAGCGGCGCGCGGTGCGATCAGCCGCCGACCTCQ 
GCCACGAACCGGCTCACGTCGTCGCTGTCGCCCACGAGCAGCAGCGTGTC 
GCCGTCGCGGATCACGTAGTCCGGTGTGGGCGCCTCGAGCCGCGGCTTGT 
5 CGCCGGGCCGCTTGTTCGTGTGCGGCCGCACACCGAGCACGTTGATGCGG 

15 TACCGCTGGCGGATCTTCGAGCCGGCCAGCGTCTGCCCGACCAGCGGCCC 

GTGGGCGTTCCAGGGGACCACGCGGTAGTGGCTCGCGAGGTCGAGGAGGT 
CCTGCGCGAGCGGCATGGTGATGTCGGCGCCGACGCGGCGGCCCATCTCG 
GTCTCGAGCTGGATGACGCGGGTCGCGCCCACCGCGCGCAGGATGTCGGC 

20 10 CTGGCGATCGGTGGCGGCGCGCGCGATGATCTCGCGCACGCCCATCCGGA 

CGAGGGAGGCCACGCAGAGCACGGACGGCTCGAAGTGCTCGCCGAAGGTC 
ACGATCGCGGTCTCCACGTACTGCGCGCCGATCCCCTCGAGCACCTTGTG 
GACGGTGGCGTCGCCGACGAACGCGGCCGAGGTCTTGTCCTTCACGGCGT 

25 

CGACGGCCTCCGGGTTGTTGTCGACCGCGATCACCTCGGCCCGGTTCTTC 
1 5 CAGAGGGTCTCGACGACCGACGTGC CGAACCGCCCGAGCCCCGATGACGA 

GGACGCTCTTCGATTTCATGGTCTCCGGTCGCGCGCCGCCCCCTCGGGGC 
30 GCGGCGCCGCGAAGGTCTCACGGATGCGCCGGGAGCGCCACGGTTCGGCG 

CTGCCGTCGTCGCGACGGCGCGGCCCGCGCGGGCCGCGCCGCGCGGCGCT 

CAGTAGAGCTCGTCCTGGTGCTGCCACCGCTCCGAGATCCACGCCTTCAG 
2 0 GTACGCGATCTCCTCCTCGTACGTCGTGAAGTCGTCGCGCCAGCTCCAGC 

35 

CCTCGTAGCTCCGGTACGCCTCGCCCCACCGCGCCTCGTCCCGGCGCGCG 
CTCGCGTCGATGCGCTCCACGTAGCCGTCCACGATCGCGTGGATCTCGGC 
CTCGGCGAGCGCGCCGCGCAGGACCTGATCGTAGCGGGCGCGCAGCGGGT 
CGCCGATCGACGGCTCCTCGAGGAGGCGCTCGAAGAGGAGGTTCACGTCG 

40 

2 5 CGGTAGTCGACGCGATCCGACGCCGGCTCGCGCTCGGTCTCCCACGACTG 
GCCGAAGCTCGCGTTGAAGTCCCACGGCGCGTAGCGGAATACGCCGTCCG 
CGGCCGGATCGCGGTAGTGGTAGCTGTTCTTTCCGGCCGAGTCGTTGGCC 
45 ACGATGAACGTGACGAAGATCCACCAGTCCTCGTAGTCGCGCAGATCGAT 

CCGCGACCCGATCTCGGCGGCGAACGTGGCGTCGTCGGACTCGGCCACGA 
30 AGCTCACGAGATCTTCCAGATCCGAGAACGCCTCCGGCTCGCCCTCGGCC 

50 



55 



WO 00/22139 PCT/US99/23535 

192 



GGCGCCCCTTCCTTCTTCTCGAAGCCGTCGTGCAGCGTGTCCTTGGGGTC 
GCCGGACCGGTCGGTCAGCGCGAAGTTCGCGTCGTGGCTGACCGCCTTGT 

10 

AGAGGTTGCCGTCCTGCGGGTAGCCGTGGTCCTCCATCAGGTAGCCGTCG 
ACGTGATCCGCGACGGTGTAGAGCCCCGCGTACTCCCCGTCGAGGTACAG 
5 GACGGCGCTGTAGGTCTTGATCTGGATGTGCTCGGGATCGAGGCGGTTCC 

75 AGAGGTCATAGGCGAGGCGCTGCCGGACATAGGAGTTGTCGTCGAACGTC 

GTGATGAGCACGACCTTGCGGCGATCGGTGAAGCCGCCCGCCTCGTCGGG 
CTCGTTGAACTTGTCGTCCTTGGGGAACTTGAGGGTGTAGCTCCGCTTCG 
GGTACGAGAGCGAGCTCTCGCCGCGGAGCTCCGCCTCCGCGGCGTACGTG 

20 10 TGGCCGCGGTAGATCACCGTGGCCGGGGCGTACTCCTTGTCCTCGGGGAC 

GGGCGAGAGGAAGAGCACCGGCAGGCCGTACTCCTCGGGGTAGCGGGTCG 
GATCGACGACGGGCACGTTCGACGGATCGGCGAAGGCGTCGGCGACGCCG 
ACCTTGACGCGCCCGACCTCGGACGTCTGCGCGACGCGGATCTCGATGTC 

25 

GTAGACGGCGGCCTGATCGAGCCCGGGCGAGAACGTCACCTCGCGCGCGA 

1 5 TCGGGTCGTACGCGGCGCCCTCGGGGAGCGGGCCGACCTCGAACGCGTCG 

CCGGCGAGCGCGAGGCCGCTCGCGCACGTCACCGGGAACGTCACGGTCTC 
30 CCCCTCGAGGAGCCAGTGCGGGCCGCCGCCCGACGGCTGGCAGCGCGAGC 

CCTCGGCGCTGGAGCCCCCGCCGCTGGAGCCGGAC 

20 Seq ID No 90 (>Contigll) 
35 GGCGACCCCACATATCACATAGTAGAATCAGTGTGAGTTAGACAAATGTC 

GAGTGATGAGAAGGACAGAAGTGAGAACTCTGTCGATCACTGTAGAACGA 
GAGAGTATGAGCCTGCATACATGATAGCGGACATGAGAACGAGTGTANTA 
TGATGCTACTAAGAGAGTAACAGATCAGAGACTAGAGTAGAGCAATAGAA 

40 

2 5 NTCAGAGATAAGTCAATGACGAGGAGTAGTGATAGAGCTCTTAATAATGG 

CTGAGGTCGAAGATAGAAGTGCATAGAGCGATAGATATACAATCGGTTGA 
AGCAGAGAGTAAGATAAGATCAGACACNGAGTACAGAGAGAGACGAATAG 
45 ATGGCGTGATNTCACAGAGAGGTGCGAGCGT AG CTGACGAGAG CAGAGAC 

C-CAGAGTAAGTCACACCTAGATAGTTACGGCGAGAGACAAATGATAGGAA 

3 0 GGAGTGGACGAGATCAACAGNCCGGAGCACAAGAACGTGAGATGCGACCG 
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TGTAATAAACAGGAGACAAGAGCGACTACATAAGAGAGCGAAGCGAATAG 
ATAAGATATAAGCCCAGAGCAAAATAGAAGGAGAGAGAGAGTATTTGTAA 
TAAAGCAACAAGACGGAGAGAGCGAAGCAGCAGGCAACGATTAGAAGAAA 
GACGACAGGAAAGTGAAAGCGAAAGAGAGCAGGTAGAAAGAGAACCAAAA 
5 AAGCACGAAGGAAAAGGAAGCTTCTATGATAGGTGCGGGACAAGGCGTAG 
15 CTACAGGAGACAGCCGGCATACGAGGAGCCGGTAAAAGCTAGCCTTTCAG 

AACACATCGGGAGCGCGTAAAGGCGGACCACGCTCGACGGGATCATGTAC 
GCCGACAGCGACGCCTTCAGCCCCGCGCGCACGTCCGGCGCGTCGCCCGC 
CGCGTCGCCGTCGAGCACGACGTAGGCGACCAGGCGGGCGTCGCCCGGCG 
20 10 CGTCCTCGCGCAGGACCACGGCCGCCTGGCCCACGCCGGGCACGCGCCGG 

ATCTGCGCCTCGACGTCGCCGAGCTCGATCCGGTGCCCCCGGAGCTTGAT 
CTGGTGGTCCGAGCGGCCCTGGAACTCGAGCATCCCGTCGGGCAAGAAGC 
GCGCGACGTCGCCGGTCCGGTACATCCGCCCGCCCGCGGCGCGCGCGCAC 

25 

GGGTCGGGCAGGAAGCGCTCCGCGGTGAGCCCGGGCTGCCCCACGTAGCC 
1 5 GCGCGCGAGCGGCGCGCCCGCGATGTAAAGATCGCCGAGCGCGCCGATGG 
CGGGGCGGCGCATCGCGCCGTCGAGCACGAACACCTCGGCGTTCGCGACC 

30 GGCGCGCCGAGGGGGACCCACGTGACCCGCGGGTCGCTCGGCAGGACGCA 

GCCGGTCACCGCGATCGCGGCCTCGCTCGGCCCGTACATGTTGATGAGGT 
CGCCGTCGTGCTTCGCGTAGAAGCGCCGGACGAGATCGAGCGGCACCGCC 
20 TCGCCGCCCACGAGGACCTTCCGCAGGCTCGCGGGGAACGGCTGCTCGGG 

35 CCCCCCGAGGAACGCCGCGAGCATCGAGGAGACGAAGTACGCGGTCGTCG 

CCCCCTCGTCGCGCACGAGGCGCCGAAGGTACTCGGGATCGCGGTGCCCG 
CCGGCCCGGGCGACGACGATCCGCGCGCCGAACGAGAGGGGCCAGAAGAT 
CTCCCAGACGGAGACGTCGAAGCCGAACGCGGCCTTGAGCAGGACCCGGT 

40 

2 S CGTCCGCGGTGAGCGCCCAGTACCGCTGGATCCACTGCATCTGGTTGACG 

ATGGCGCGGTGGGAGATGAGGCTCCCCTTCGGCGTGCCCGTCGATCCGGA 
CGTGTAGATGACGTACGCGCCGCTGTCCGGCGGCGGGCTCACGGCGGGCC 
45 GCGCGTCGGAGCACGCGGCGATCTCGGCGGCCTCGGCGTCGAGGAGCAGC 

GTCGTCCAGCCGCCGGTCGGGAGCTCGTCGGCGATCGCGTCGTGCGTGAC 

3 0 GAGGAGGCGCGCCCGCGCGTCCCGCATCATGAAGGCGAGGCGC7CGCCGG 
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GGTACTCGTGGTCGAGCGGCAGGTAGGCGCCGCCCACCTTGAGCACCGCC 
AGGGTCGCGACGACCATGTCCTCGGAGCGCGGCACGCAGACCCCGACGAT 
CGTGTCGAGCCCGACGCCGCGGCGGCGCAGGCAGCTCGCGAGCCGGTTCG 
CGCGCCGCTCGAGCTCGCCGTACGTGAGCGACTTGCCCTCGCTGCGCACC 
5 GCGACGACGTCGGGGTGCTGCTCGGCGCGCTCCTCGAACCACCGGTGCAG 
15 CGCGCAGGCCGACGGCAGCTCCATCGCGGGGCCGCGCGACCACGCCTCGA 

TCTCGGCGCGCTCGCCGGGGCCGACGTACTCGCCCTGGGCGACCGGACGC 
TCGGGGTGGCGCGACAGGTCCTCGAGCAGCGCCGCGAGGCGCTCGGCGAG 
GCGCTCGGCGTCGCGGCGCGCGCCGGCCGACGCGTCGTAGCGGAGCTCCA 
20 10 GCGACGCCGACX3GGCCCGCGCCGGCGCAGTGCAGCCGGCAGGCGACCTGG 

TCAGACGTGCTCCAGACGTCGAGCACGCGGGCCCGCGCGCCGTCGAGCGA 
CAGCGCCGGCGCGGCCTCCGCGGCGGAGAAGCCCCCAGCTCATCCGGTGG 
CTCACCCCGGGCGCGGCGTCCTGGTGCGCGGCCGCCTCGGCCTCGGCGAG 

25 

CGCGAGCCGCCGCGCGACGTCGGCGAGCGTGTCCGAGGCCGAGATCTCGA 
15 TCCGCACCGGCAGGAACCGCGCGAACGGCCCCACCGCGCCCGCGAGCGCG 
TCCAGCGACCGCCCGTCGAAGCGGACGGCCACGGTGACCTCGGGCTCGTT 

30 GCCGCCGCTCATCCGCCACAGGAGCGACGCCCACAGGGCCAGGAGCACGA 

TCCGCTGCGGGACCTGCCACGACGACGACCAGCGCTCGACCTGCGCCATC 
CCGCCTTGTCCCAGATCGACCCGCGCGCGCCCCGAGCCGGCGCCGGCGCC 
2 0 GGCGCCGCCGCGGCTGAAGGCGAGGTGGAGCGGGGGCCCGAAATGCGAGC 

35 GGCGCTCGGCCCAGAACCTGCGCCCGTCGCCGGCGTCCTCCGACTCGAGC 

ATCCCGTTGAGCCACTCGGCGACGTCCGCGTACTGCTGCTCGGGCGGCGC 
GCCCGCGCCCGCGGTCGACGCGCAGAGCTCGCGGACGAGCGGGGCGATCG 
ACTCCTCGTCGACGCACCACGCGGGCGCCGCGAGCACGAGCCGGCGCTCC 

40 

2 5 TCCGGGCCGACGCGGACCAGGCCGACGCGCAGCCCGTCGTCCGCGCCGCG 
GTCCTCCGAGAGGCGCGCGACGAGCCGCGACATCCGCTCGCCCTGCTCGG 
CTTCGGAGCACCCGACCCAGTCGTCCTGCTGACGCCCACGCGAAGCGCGG 
45 CTCGCCGACCACCTGCGCGGCCTCGCCCGCCCCGCCTCGACGAGGCGCGT 

GCGCAAGATCTCGTGCCGCTCGGCCAGCGCGAGCGCCGCCGCCGAGAGCC 
30 GTCCCTCGTCGCACGGGCCGGTCACGGCGACGACGGCCAGCGTCCGGCAC 
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CCGGGCGCCCCCGCCTCCCGGTCGAGCGCGCGGATCGCCCGCTGCTGCGG 
CGAGAGGCTGAAGCCGGTCATCTCGTGGTCACTCATCCAGGTCGTCCTTC 

10 

GGTGAGGTCTTCGCTTCGCCCGGCGCGCCCGGGCGCGGGAGGGTCACGGC 
GCGCCGGCGCGCGGAGGTCAGCTTGTCGAGCGCGGCGCCGCGCGCGGCCT 
5 TCCGCTCGAGCTCCCGGCGGGCCGCCGCGGCGCGCTCGAGCTCCCCCCGG 

15 AGCTCCGAGACCGGGGTGTCCGGCCGCGCCGTCGCAGTGGCCAGGATCTG 

CCGGTAATCGCTAAGGAAATTGTCGACCGTCGCCGCCCGGTACAGCTCGC 
TGCTGTGCTCGACGCCGAAGCGGAACGAGCCGCCGGCCTCGGCGACCGTG 
AGGACGAAGTCGAACGCCGTCGTGGTCGCCTCGCCCTCCAGCGCCTCGAG 

20 1 0 CTCGAGCCCCTCGAGCTTCATCGGGGGGACGTGCACGTTGCGCATGACGA 

ACTTCGCGTCGAAGAGGGGCACGTGCCCGACGGCCCCCTTCGGCCGCAGG 
GCCTCGACGAGCCGGTCGAACGGCAGGTCCTGGTGCTCGAACGCCTCGAG 
CGCGACGTCGCGCACGCGGCGGACCAGCGCGCCGAACGTCGGGTCGCCCC 

25 

CGCAGTCGGTCCGGAGCACGAGCTGGTTGACGAAGAAGCCGATCATCGGC 
1 5 TCGGTCTCGACGCGGTTCCGGTTCGCGACGTCGGTGCCCACGACGAGGTC " 
CTCGAGCCCGGTGCGCTGGTGCAGGACGAGCTTGTACGCGGCGAGCAGGG 

30 CCATGAAGGGGGAGATCGCCTCCCGCTCGCAGAACGCCTTGATCTGGCGG 

GTGAGCTCGGCCCCGGCGTCGAGGCTCCGCCGCGCCCCGCGCCACGTCCT 
TCGCCCCGCCGGCTCGTGGTCGACCGGCACGCGGGCCCGGCGCAGCGCGC 
2 0 CCGAGAGCTTCGTCGTCCAGTACCGGAGCTCGCCCTCCAGGACCTCGCCG 

35 GACAGCCACGCCCGCTGGGCTGCGGCGAAGTCGACGTACTGCGCCGGGAG 

CTCCGGCAGCCGGGAAGGCTGGCCCTGCGCGAAGCCGCCGTAGAGCGCGG 
CGAGCTCGCCGACGAAGACGCCGACCGACCAGACGTCGAACACGACGTGG 
TGCACGACGAGCGCGATGACGTGCTCGTCGTGGCGCTTC CGG ATGACCCG 

40 

2 5 CACGCGGAGGAGCGGCCCGCGGCTCAGGTCGAACGGCGCGAGGCTCTCCT 

CGAGGACGAGCGCCGAGACCGCCGCGTCGAGGGCCTCGCCCGCGAGGTGC 
TCGAGGTCGGACATCCGGAACGGCACCCGGGCCTCGGGCGCGACGACCGG 
45 GAACGGCACGCCGTCCCTGGCGCTGAACGTCGTCCGCAGCGCCTCGTGGC 

GCCGCGCGATCTCGAACAGGCTGCGGCGGAGCGCGTCGACGTCGAGCCGG 

3 0 CCCGTCGCGCGCACCACGAACGGGATGTTGTACGCCGGGCTGCCCGGCTC 
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GAGCTGATCGACGAACCACAGCCGGTGCTGCGCGAACGACAGCGGGAGCG 
GGCCGTCGCGGGGGATCCGCGCGATCGGGGGGAACTCGCGCCGCCGCGCC 

10 

TCGCCGCGGCGCGCGGCGTCGACCTGGGCCGCGAGCGCCGCGACCGTCGG 
CCCCTGGAAGAGCGCCCGCAGCGGGAGCTCGACGCCGAGCTGCGCGCGGA 
5 TCCGGGACATCACCTGGGTCGCGACGAGCGAGTCGCCGTGCAGGCCGAAG 

15 AAGTCGTCGTGGACGCCGATCTCGTGGACGCCGAGGAGGGCGCTCCCAGA 

TCGCGGGCGATCGCGCGCTCGGACTCGGTCGACGGCGCGGCGAACGCGGC 
CCCGGCGTGCGCGCGGGAGACCGCCGACGTCGGGAGGGCGTTCGGCGCGG 
GCGCGAGGGGCGCGTCGGCCGGCGCCGGCTCGGCGGCGGGGGCGCCGCGG 

20 10 CGCGGCGCGATCCAGTGCCGCGCCCGCTCGAACGGGTACGTCGGCAAGCG 

GACGAGCGCGCCGGGGGACGCCCCCGCGGACGGGCCGTCCAGTCGACGGC 
GTGGCCCGCCTCCCAGAGCTGGCCGAGGGCCTCGGCCAGGCTCGCGGGCT 
CGGACGCGGCGTGGGTCGACCCGAGGCTCGCGATCGCGGCGCCGCCGCGC 

25 

CCGGCCAGCGTCTGCCGCACCAGCGTGGTCAGCCCGCGGCCGGGGCCGAC 
1 5 CTCGAGGAACAGGGCGTGCCCGGACGCGAAGAGCGCCTCGACGCCGTCGC 
TGAAGCGGACCGGCTGGCGGAGGTGCCGCGCCCAGTAGGCCGGATCGGTC 
30 GCCTCGGCGTCGGTGAGGAGGGCGCCGGTGACGTTCGAGACCACGGGGAT 

CTCCGGCGGGGAGAGCCGCGCGCGCCGCACGCTCTCGAGGAACGGGGCCA 
CCGCGCCGTCGATGAGCGCGCAGTGGAACGCGTGGGACGTCTGCAGCGGC 
2 0 CGGGCGAACACCTCGCGCGCCTCGAGGCGCGCGGCGAGATCGCGGATCGC 
35 GCTCGCCGGGCCCGCGACAACCGTGAGCTTCGGGCTGTTGACCGCGGCGA 

TCTCCAGGCCGGCCTCGAGGAGGCCCTCGACGTCCGCGGCCGGCAGGCCG 
ACGGCCAGCATGCTCCCGGCCGGCGCCGCCTGCATGAAGCGCCCCCGATC 
GATGACCAGGGACATCGCGTCCTCGAGCGTGAACACGCCCGCGACGCAGG 

40 

2 5 CCGCCACGAGCTCGCCGAGGCTGTGGCCGATCATCGCCGCGGGCTCGATC 

CCCCAGCTCATCCAGAGCCTGGCGAGCGCGAGCTCGACGGCGAAGAGCGC 
GGGCTGCGCCAGCGCGGTGCCGAGCAGCGTGCGCCCGTCGCCCTCGCCCT 
45 CGCGGAAGACGACCTCGCCGAGATCGAGGCCGCGCGCCCGCGCCGCCGCC 

GCGCACGCGTTGAAGGCGCTCCGGAACGCCGCCTCCTGCGCGTAGAGCGC 

3 0 GCGGGCCATCCCGACGGCCTGCGCGCCCTGGCCCGGGAACGCGAAGACGG 
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GCGCGGCTCATCGGGGCGCGCGAGCGCGCTCGCCCCCTCGCGGGCGAGCC 
CCTGGATCGCCTCGGCGCGCGTCCGGGCGACGACCGCCCGGCGGTACGGG 
TGCTCCGCGCGCCCGGTCTGGAGGGTGAACGCGACGTCGTCGAGCGGGAC 
GTCGGTCGCCTCGAGGTGCGCGGCGAGCTGCGCGCAGGCCGTCGACAGCG 
CCTCCGGCGTGCGCGCCGAGAGCGTCAGCACGTGATCGCGCTCCGGGGCC 
GGGGCGCGGGGCGGCAGCGGGGGCGGCTCCTCGAGCACGACGTGCGCGTT 
CGTCCCGCCGATCCCGAACGAGCTCACGCCCGCGCGGCGCGGGCGGAGCT 
CGCGGGGCCAGGGCGCCGCCTCCCGCGGGACGAAGAACGGGCTCGCCGCG 
AGGTCGAGCTTGGGGTTCGGCGCCTCGAAATGGACGCAGGGCGGGATCTC 
20 T c GCCGCTCCGCACGACGTGCGCCGCCTTGATGAGGCCCGCGACGCCCGCCG 

CGGCGTCGAGGTGGCCGATGTTCGCCTTGATCGAGCCGAGCGCGCAGTAC 
GCCTTCCTCGGGGTCTTGCGGCGGAAGGCCTGCGTGAGCGCCTCGACCTC 
GATCGGATCGCCGATCGCGGTCGCGGTGCCGTGGGCCTCGACGTAGCCGA 
TCGAGCCGGGATCGACGCCGGCGACCGACTGCGCCTCGGAGATCGCCGCC 
GCCTGGCCGTCGACGCTGGGCGCCATGAAGCCGACCTTGCGCCCGCCGTC 
GTTGTTGACGGCGGAGCCCCGGATCACCGCGTGGACCGTGTTTCGGTCGC 
GGAGGGCGTCCGCGAGGCGCTTCAGCGCGACGATGCCGACGCCGCTGCCG 
CCCACGGTCCCCTCGGCGCGCGCGTCGAACGGCCGGCAGCGGCCGTCGGG 
GGAGCAGATGCTGCCGGGCACGTACGGATACCCGCGCTTCTGCGGGATGC 
CGATGGAGACGCCGCCCGCCAGCGCGAGATCGCACTGGCCGCCGAGGAGG 
35 CTCTCGCACGCCATGTGGACGGCCACCAGCGACGTCGAGCACGCGGTCTG 

CACGACCACGCTCGGCCCGTGGAGGTCGAGTTTGTACGAGACCCGCGTCG 
CGAGGTAATCCTTCTCGCTCGCCAGCATGAGCGCGTGCGGATCGACGGTG 
GCCGCGAGATCCGGGTGCGAGAGGAGCTGGAGGAGGTACGTGTTGGAGCC 

2 5 GCACCCCCCGAAGACGCCGATCGCGCCCGGGAACCGGGCCGGATCGCAGC 
CGGCGTCCTCCAGGGCGGCGACCGCGCACTCCAGGAAGAGGCGCTGCTGC 
GGGTCCATGAGCTGCGCCTCGCGCGGCGAGTACCCGAAATAGGACGCGTC 

45 GAAGCGGTCGATGTCGTCGAGCAGGCCGCCCGCGCAGACGACGGGCGCCC 

CGGGGGCCGCGCTCGCGCCGACCGGCGGCTCCTCGCGCTCGCTCTCCGGG 

3 0 AAGCGCGCGATCGACTCGACGCCGCGCCGCACGTTCTCCCAGA3GGCGTC 

50 



25 

15 

30 



40 



20 



55 



WO 00/22139 



PCT/US99/23535 



198 

GACGCTCGGGGCGCCGGGGAAGCGGCCCGCCATGCCGACGATCGCGATGT 
CGCTCCCCCCGTCCTCGGTCTCGATCGGCTCTGACATGGCTATCCTCGCC 
CCCGGCGGCGTCGCGCGTCGCGGCGCGCCTCGGCGCGCTGCGCCCCGACG 
TCGGCCGGCTCGGCCTTGACCGTCGCCGCGTCGAGCCGCTGCGCCAGTTG 
5 CTCGATGGTCGGGTACTGGAACAGGTCGGTCAGCGACACGGCCTGCGCCG 
CGGCGCCCTCGTCGGGCGCGCGCGCCGCGATGCGCTCGGCGAGCAGGCGC 
TGCGCGCGCACGAGGAGCAGCGAGGTGAAGCCGAGCTCGAAGAGGTTGTC 
GGTCACGCCGACGGCCTCGACCTGCAAGACCTCCGCGAGCACCGAGGCGA 
TGAGCCGCTCGGTCGCGGTCCGCGGGGCGACGGCCGCGGCGCGCGGCGCG 
10 ACCGCGGCGGGATCCGGCAGGGCGGCGCGGTCCACCTTGCCGTTCGCGCT 
CAGCGGCAGCGCCGGGAGGACGACGACCTCCGCGGGGATCATGTACTCCG 
GCAGCTTCTTCCGGACGAAGTCGCGGAGCGCGGCGCCATCGCCGTCGGCG 
CCGACGACGTACGCGACCAGGCGCTTCTCGCCCGACGGATCGGTCTTCGC 
CGCCACGACCGCCTGCTCGACCGAGGGGTGCTGCGCGAGGGCGGCCTCGA 
15 TCTCGCCGAGCTCGATGCGGAAGCCGCGGATCTTCACCTGATGGTCGGTG 
CGCCCGAGCAGCTCGATGGTCCCGTCGGCGAAGTAGCGGCCCAGGTCGCC 
TGTCCTGTACAGCCGCTCGCCGGTCGTGGGGTGCTTCAGGAACCGCTCCC 
GGGTCCGCGCCTCGTCGCGCCAGTATCCGAGCGCGACGCCGATCCCGCCG 
ATGTGGATCTCGCCGGGGACCCCGATCGGACACGGCTCCAGCCCCTCGTC 
2 0 GAGCACGTAGGTGTGCTGGTTCGCGAGCGGGCGGCCGTAGGGGATGCTGC 
GCCACGCCGGGTCGACGTCCGCGATCGGGTGGGCGATCGACCAGATCGAC 
GCCTCGGTCGCGCCGCCGAGGCTCACGACGCGGGGCGCGCGGCAGGCCGC 
GCGGATGCGATCGGGGAGCTTCAGCGGGATCCAGTCGCCGCTCATCATGA 
CGAGGCGGAGCGACGACAGCGCCGGGTCGCCCGCGCCGGGGGACGCGTCC 
2 5 ATGAGCATCTCCATCAGCGCCGGGACCGAGTTCCACACGGTCACCCGCTC 
GCGCTCCACGAGCTCGCGCCAGTGCCCCGGATCCGAGGCGCGGGTACGGT 
CGGGGATCACGACGGCGCCTCCGGCGGCGAGCGTCCCGAACACGTCGTAG 
ACCGACAGGTCGAAGCTCAGCGACGAGAGCGCGAGCACCCGGTCCTCCGG 
GCCGACGTCGAAGCGGCGGTTGATGTCGAGGACCGTGTTCACCGCGCCGC 
30 GGTGGTCGATCATCACGCCCTTG3GCAGCCCCGTGGACCCGGACGTGTAG 
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ATCACGTAGGCCAGGTCGTCCGTGCTTCCGCCGGGCGGCCGGCGCGCGAC 
GGGCTGCTCGCGCCACCGCTCGTCCGCGTCGACGGCGAGGCGCTCGATGC 
CCGCGGGCCAGGCGATCGTCCCGTCGACCGCCGACTGCGTGAGGACGAGG 
CGGACCTCGGCGTGCTCCAGGAGGTGCCTGAGGCGCTCCTCGGGGAGGCG 
5 AGGGTCCAGGGGCAGGTAGGCGGCGCCGGCGCGCAGCACGCCGAGCACGG 
CGGCCACCTGCTCCCAGCCCTTCTCCATGACCACGGCGACGAGCGCGTTC 
GCGGTCGCTCCGGAGCGCGAGGCCGCCGCGGCGATCGCCTCGGCGCGCCG 
GGCGAGCTCCCCGTAGGTGAGGCGCCGCTCGGCGTCGACGACCGCGCACG 
CGTCGGGCTGCTCGACGGCGCGCTCAAAGAACGGCTCCTCCAGCCGGAGG 
20 io TGATCCGGGGTTGCGACCGCGGTGTCGTTCCACGCGACGAGGGCGCGCTC 

GCGGTCTTCCGGCGCGACGGAGAGCGCGCGGACGCGCTGCGCGGGGTCCT 
GAGTGGCGCGCGAGAGCACGCTCTGCATCGTGGCGAGCATCCGGTCGATG 
GTCGCCGCGTCGAAGAGGTCGACGTTGTACTGGAGCGAGATCACGTCGCG 
CCCGCCGCGCGGCTCGACGCTGAAGCGCAGGTCGAAGCGCGTGGCCTCGA 
15 CCGGGAGATCGAGCGGCTCGATCCGCACCTCGCCGAGCTCGAGCGCCTCG 
GTTGGGGCGTTCTGCACGACGAGCATGACCTGGAACAGCGGCGAGCGGCT 
30 CAGGTCGCGGCGGGGGTTGACCGCCTCGACCACCTTCTCGAACGGGGCGT 

CCTGGTGCTCGAACGCCTCGAGCGCGACCTTCCGCGCCCGCGAGAGGAGC 
TCCTCGAAGGTCGGGTCGCCGCCGAGGTCGAGGCGCATGACGATCGTGTT 
2 0 CACGAAGAAGCCGACGAGGGGC7CGAGCTCGGGGCGAGGCCGGTTGGCGA 
35 CCGCGGTCCCGATGGCGAGGTCGTCCTGGCCCGAGCTGCGCCGGAGGAGC 

ACGCCGAGGGCGGCGAGCAGGACCATGAAGCGGGTGGCGCCGCGGCTCCG 
GGCGAGCTCGTCGAGCTGCGCCACGAGGCGCGCGTCGAGCGGGAGGACCC 
GCTCCGCGCCGCGGAACGTCTGGACGGGCGGCCGCGGTCGATCGGTCTGG 

2 5 AGCTCCAGGACCGGCAGCCCGCGGAGGGTCGCTGTCCAGTGAGCGAGCTT 
GTCGGCGAGCCGCTTCCCCGCGAGGTGGCGGCGCTGCCACACCGCGAAAT 
CGACGTACTGGAGCGGCAGCTCGGGCATGTCCGCGGGCCCGCCGCCCCGC 
GCGCGCCGGTAGAGCTCCGCGAGATCGCGGACGAGGGGTTGGAAGGACCA 
GGCGTCCGTGACGATGTGGTGCGTGGACAGGACCAGGACGCAGACGTCGT 

3 0 GGTCGAGGCGGAACAGCCTGGCGCGGAACACGGGCCCGCGCGCGAGGTCG 



40 



45 



50 



55 



10 



WO 00/22139 PCT/US99/23535 

200 



AACCCCGTGGCCTGCTCGCGCGACGCCCAGGCGCGCGCCGCGGCCTCCGC 
CTCGTCCGGGGGCGTGCCGCGGAGGTCGACCACCTCCGCGGGGGCCGCCT 
CGGGCTCGCAGATCTTCTGCGCCGGCGTGGGGCTCGCGACGAACACCGTG 
CGCAGGCTCCAGTGCCGCCGGACGAGCGCGGCGAGCGCGGAGGACAGCGC 
5 GTCGACGTCGACGAGGTTCCGCAGGCGGACCGCCTGCACCACGTTGTAGG 
15 CGGTCCCGCCGGGGAGCAGCTGCTCGAGGACCCACAGGCGCTCTTGCTCG 

TACGAGAGCGGATACGGCTCGTCCGCCGGCGCGCGGCCCAGCGAGGGCGC 
GATCTCGCTGGCGGGCACCGTCGCTGCGGCGGCGGTGGTCGAGGCGGCGC 
CGGAGGAGAGGCGATCGGCGAGCTGGTGGAGGGTTGGGTGCTCGAAGAGC 
20 10 GTGCGGAGGGTGGTGCGGATGCCGAGGGAGGACTCGATGCGTCCGAGGAC 

CTGCATGGCGAGCAGGGAGTGGCCGCCGAGGTCGAAGAAGCTGTCGTGTC 
GTCCGACGCGGTCGAGGTGGAGGACGGATTGCCAGATGTGCGCGAGCTCC 
CGCTCGAGCTCGCCCGAAGGGGGCTCGTAGTCGGCGTGCGCGGCGGGTGG 

25 

CGCAGGGAGGAGCTTCTTGTCGACCTTGCCCGAGAGGGACATGGGCAAGG 
1 5 CGGGGAGCAGGACGAAGTGGGCGGGCACCAAGGCGTCGGGCACCAGGCGG 
GCCATGCCCTCGCGCAGGTCGCGCTCGGAGGGCGGGTCGGCGCCCGGCAC 

30 GACATAGGCAATCAGGCGCGCGGCGCTGCCTTGGCCGTGGAGGACGACGA 

CGCCCTCGCGGACGGCGGGCAAGCGTCGCAGGGCGGATTCGACCTCGCCG 
AGCTCGACACGGCGACCGCGGAGCTTGACCTGCTCGTCGCGGCGTCCGGC 
2 0 GAAGGCGAGCTGTCCGTCGGGGCGCCAGCGCACCAGGTCGCCGGTGCGGT 

35 AGAGGCGTGCGCCGGGCTGGCCGAAGGGATCGGGCAGGAAGCGCTCTGCG 

GTCAGGTCCGTGCGTGTGTAGCCCTGGGCGAGGCACGCTCCGCCGATGTA 
CAGCTCGCCGAGGACGCCGGGCGGGACGGGCTGCATGTGCGGGTCGAGGA 
CGTAGACGAGGGCGCTGTCGATGGGTCGGCCGAGCGGGGGCTCGTCGCCG 

40 

2 5 AGGTCGGCGACCTCGGCGACGGTGGTGATGACGGTGGCCTCGGTGGGGCC 

. GTACATGTTGAAGAGGCGGAAAGGGAGCGGTCGCCGGAGCGGATGGAGCT 
TGTCGCCGCCGACGGTCATCGCGCGCAGGGCGATGCCGGTCCAGTCTTGC 
45 TCGAAGCACGCCTCGGCCAGGGGCGTGGGCATGAACGAGAGCGTGGCCCG 

CTGAGCGACAAGCCAGGAGACGAGCGCTGTGGGAGAGCGGAGCGCGTCGT 

3 0 CGTCGGCGAGGAGGAGTGCAGCGCCGCAGGCGAGCGGCGTCCAGATCTCG 
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TAGACGGAGGCGTCGAAGCCGCTGGAGGCCAGCTGAGTCCAGCGATCGCG 

GGGTGAGAGCGCGAGCAGGTGCTGGAAGAAGGAGACGAGCCTTGAAAGGC 

TCGCATGGCGCACACAGACGCCCTTCGGCGTGCCGGAAGAGCCGGAGGTG 

AAGAGGACATAGGCCAGGTCGTCGGGCCTGGAGACGAGAGGAATGTGGGT 

GCTGGGCGCGCACGCCCCGTCCTGGACGAGGTGGACGGGGCAGGGGGCGG 

CGGTGAGCTTGTGGCTGGCCTGGCTGCTGGTGAGCACGAGCGCGGCGCGG 

CAGTCGGCGAGCATCTCGGCCAGGCGCGCCGGGGGGTTGGCGGGGTCGAG 

CGAGGCATAGGCGGCGCCTGCCTTGAGGACGGCGAGCTGGGCGGCGACCA 

TGCGGGGCGAGCGCTCGATGCAGACGCCGACGACGCTGCCGGGGCCGACG 

CCGCGGTCGCGCAGCCACAGGGCGAGCTCGGTGGACCAGGTGCTGAGCTC 

TGCGTAGGTGAAGCGCTGGTGTCCGAACTCGAGCGCCGTGGCGTCCGGCT 

GTCGAGCGGCGTGGGCCTCGAAGAGCGCATGGACGCAGGCGGGGGCCGGG 

GCGGAGGCGGCCTGTCGTGCGGCGGCAGCGCCGCTCCAGTCGTCGAGGAG 

CAATGCGCGCTCGGCGTCGGAGAGCATCCGGAGCTCGGAGAGCGGTCGAC 

CGGGGTGCTCGACGGCGCTTTCGAGCAGGAGCACGAAGTGGCGCGCCATC 

CGCTCGATGGTGGCGGGGTCGAAGAGCTGCTGGTCGTACTCGAAGCGCAG 

GGCGATGCCGGAGTCGAGCTCTGCGGCGAACAAGGCGAGATCGAACTCGG 

CCGCTGCCTGCTCGTCGGCGAGCGTGGTGAGCTCGAGCTCTCCCTGCGCG 

ATCCGCACGTCCTCCGCGCCGGTGGTGAGCGCGGCGAGGCGGGGATCCAG 

CGACGGCAGAGCGCCCTGGAAGGCGAAGGCGACGTCGAAGAGCGCGCCGC 

CTCGCCGGGCCGCGCCCCGGGGCTCTGCGAGCAGGTGCTGGAGGGCGCTG 

TCGCCGTGGGCCAGCCCGTCGAGGAACGCGTCGCGCACGCGGGCGACGAG 

CGCGTCGAAGGACGCGGCCCCGCGCAGCGCCACGCGCACGGGGAGCATCT 

GGACGAAATAGCCGAAAGCCCGAGTGCTCTCGTCGTCGTTCCGCCCCGCC 

GAGGGGACGCCCACGACAAGGTCGTTCTGCCCGCTCGCGCGATGGAGCAA 

GACGGTGAGCGCCGACAGCAGGACC'GAGAAGAGCGTGGTCCCGCGCTCGC 

GCGCGAGGCGCGCCAGCGCTCCGGTCAGGGGCTTTGGCAGCGTGATCGCG 

TGAGCGCGACCGCGGCGAGGGCTCGCGTCGTGGCGGGCCCGGTCGCGGGG 

AAGGTCGATGGCGGTCGTCGCGCCGTCGAGCGCCTTGCGCCAGTATTCTG 

CTCCGCCGGCCGCCTCCCGCGGCGAGGGACAGCTCACGCCGGCGGCGAAG 
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AAGCTCGACGGCGGCGGCAGCTGCGGGGGCCGGCCCGCGCGCAGCGCCGA 

GTACAGCTCCCCCAGCTCGCGAACGAGCAGCGCGAACGACCAGTAGTCGA 

CCACGAGGTGGTGAACGACCACCGTGAGCAGCGGCGGCTGCCCCTCTCCG 

CGCCGCCAGACATGCACCCGGAGCAGCGGTCCGCGCTCCAGGTCGAACGC 

GCGGCGGCGCACCTCGTCCGCGCGGGCGACGATCTCGCGCTCGTCCAGCG 

CCATCGCCGGCTCTTCGGCCCATTCCAGGGCGACATGGCGGTGGACCTGC 

TGCAGCGGATGGCCGTCGCGCGTGAGGAACGTCGTGCGGAGCGCCTCGTG 

CCGCTCGACGAGGCCCTCGAACGCGCGGCGCAGCGCGGCCACGTCGACGC 

CGGCACCGAGCCGGACCGTCCTGCCCAGGTTGTAGAGCGCGCCGTCGGCC 

GACTTCTGGCACTCCAGCCACATCGCCCGCTGCCCCTCGGTCAGCGCAAA 

CGGCTCTTCCGGCGTCGCCACCCGCGGCGACGAGGGGCCGTCCGGACGGG 

TGAGCCCTCGCTCCAGGGCCGTCGCTGCGGCGGCGGAGGTCGAGGCGGCG 

CCGGAGGAGAGATGACTGGCGAGCTGCGCGAGGGTTGGGTGCTCGAAGAG 

CGTGCGGAGGGTGGTGCGGATGCCGAGGGAGGACTCGATGCGTCCGAGGA 

CCTGCATGGCGAGCAGGGAGTGGCCGCCGAGGTCGAAGAAGCTGTCGTGT 

CGTCCGACGCGGTCGAGGTGGAGGACGGATTGCCAGATGTGGGCGAGCTC 

GAGCTCGAGCTCGCCCGAGGGGGGCTCGTAGTCGGCGTGCGCGGCGGGGG 

GCGCAGGGAGGAGCTTCTTGTCGACCTTGCCCGAGAGGGACATGGGCAAG 

GCGGGGAGCAGGACGAAGTGGGCGGGCACCAGGGCGTCGGGCACCAGGCG 

GGCCATGCCTTCGCGCAGGTCGCGCTCGGAGGGCGGGTGGGCGTCTGGCA 

CGACATGGGCAATCAGGTGCGCGGCGCTGCCTTGGCCGTGGAGGACGACG 

ATGCCCTCGCGGACGCCGGGCAAGCGTCGCAGGACGGATTCGACCTCGCC 

GAGCTCGACGCGGCGACCGCGGAGCTTGACCTGCTCGTCGCGGCGCCCCG 

CGAAGGCGAGCTGTCCGTCGGGGCC-CCAGCGCACCAGGTCGCCGGTGCGG 

TAGAGGCGTGCGCCGGGCTGGCCGAAGGGATCGGGCAGGAAGCGCTCTGC 

GGTCAGGTCCGTGCGTGTGTAGCCCTGGGCGAGGCACGCTCCGCCGATGT 

ACAGCTCGCCPAGGGCGCCGGGCGGGACGGGCTGCATGTGCGGGTCGAGG 

ACGTAGACGAGGGCGCTGTCGACGGGTCGGCCGAGCGGGGGCTCGGCGCC 

GAGGTCGGCGATCTCGGCGACCGTGGTGATGACGGTGGCCTCGGTGGGCC 

CGTACATGTTGAAGAGGCGGAAAGGGAGC.GGGCGCCGGAGCGGATGGAGC 
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TTGTCGCCGCCGACGGTCATCGCGCGCAGGGCGGAGCCGGTCCAGTCTTG 
CTCGAAGCACGCCTCGGCCAGGGGCGTGGGCATGAATGAGAGTGTGGCCC 
GCTGAGCGACAAGCCATGAGACGAGCGCCGTGGGAGAGCGGAGCGCGTCG 
TCGTCGGCGAGGAGGAGGGCAGCGCCGCAGGCGAGCGGCGTCCAGATCTC 
5 GTAGACGGAGGCGTCGAAGCCGCTGGAGGCAACCTGAGTCCAGCGGTCGC 
i$ TGGG CGAGAGATCGAGTCGGAGGTGGAGGAAGGAGACGAGC CTTGAAAGG 

CTCGCATGGCGCACACAGACGCCCTTGGGGGTGCCGGTGGAGCCGGAGGT 
GAAGAGGACATAGGCCAGGTCGTCGGGCCTGGAGACGAGAGGAATGTGGG 
TGCTGGGCGCGCACGCCCCGTCCTGGACGAGGTGGACGGGGCAGGGGGCG 
20 10 GCGGTGAGCTTGTGGCTGGCCTGGCTGCTGGTGAGCGCGAGCGAGGCGCG 

GCAGTCGGCGAGCATCTCGGCCAGGCGTGCCGGGGGGTTGGCGGGGTCGA 
GCGAGGCATAGGCGGCGCCTGCCTTGAGGACGGCGAGCTGGGCGGCGACC 
ATGCGGGGCGAGCGCTCGATGCAGACGCCGACGACGCTGCCGGGGCCGAC 

25 

GCCGCGGTCGCGCAGCCACAGGGCGAGCTCGGTGGACCAGGTGCTGAGCT 
1 5 GTGCGTAGGTGAAGCGCTGGTGGCCGAACTCGAGCGCGGTGGCGTCCGGC 
TGTCGAGCGGCGTGGGCCTCGAACAGCGCGTGGACGCAGGCGGGGGCCGG 

30 GGCGGAGGCGGCCTGTCGTGCGGCGGCAGCGCCGCTCCAGTCGTCGAGGA 

GCAATGCGCGCTCGGCGTCGGAGAGCATCCGGAGCTCGGAGAGCGGTCGA 
CCGGGGTGCTCGAGGGCGCTTTCGAGCAGGACCACGAAGTGGCGCGCCAT 
20 CCGCTCGATGGTGGCGGGGTCGAAGAGCTGCTGGTCGTACTCGAAGCGCA 

35 GGGCGATGCCGGCGTCGAGCTCTGCGGCGAACAAGGCGAGATCGAACTCG 

GCCGCTGCCTGCTCGTCGGCGAGCGTGGTGAGCTCGAGCTCTCCCTGCGC 
GATCCGCACGTCCCCCACGCCGATCGCGAGGGCTGACAGGCGTGCATCCA 
GCGATGGCGGGGTGCTCTGGAAGGCGAAGGCGACGTCGAACAGCGCGTCT 

40 

25 CGCTGCGCCTCGCCCTGCGCTCGCGCGAGCAGGTGCCGGAGGGCGCTGTC 
GCCGTGGGCCAGCGCGTCGAGGAACGCATCTCGCACGCGGGCGACGAGCG 
CGTCGAAGGACGCGGCCCCGCGCAGCGCCACGCGCACGGGGAGCATCTGG 
45 ACGAAGTAGCCAAAGGCCCTGGCGCTCTCGTCGTCGTGCCGCCCCGCCGA 

GGGGACGCCCACGACAAGGTCGCTCTGTCCGCTGGCGCGATGGAGCAAGA 
3 0 CGGTGAGCGCCGACAGCAGGACCGAGAAGAGCGTGGTCCCGCGCTCGCGC 
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GCGAGGCGCGCCAGCGCTCCGGTCAGGGGCTTTGGCAGCGTGATCGCGTG 

AGCGCGACCGCGGCGAGCGCCCGCGTCGTGGCGAGCCCGGTCGCGGGGGA 

GGTCGATGGCGGTCGTCGCGCCGTCGAGCGCCTTGCGCCAGTATTCTGCT 

CCGCCGGCCGCCTCCCGCGGCGAGGGACAGCTCACGCCGGCGGCGAAGAA 

GCTCGACGGCGGCGGCAGCTGCGGGGGCCGGCCCGCGCGCAGCGCCGAGT 

ACAGCTCCCCCAGCTCGCGAACGATCAGTGCGAACGACCAGTATTCGACC 

ACTACGTGGTGATCCACCACCGTCAGCACTCTGCGTTTAGTCCTTTCTCC 

TGTTCGGCCTATTAATTGCTACTATGGATCCACACTGCTCCGCCTCTTGT 

ATCTCCCTTATCTGCACTTGCTGCGCTTNACCTTTACGTTCCTCTCCCTG 

CTCTATACTATTTCTTCCCCCGCTTCTCGTCCTATTCTGCATTTGTCATA 

TCGTATCTTCATATACCTTTCTTTCGCTATCCTTACTGCTTCTCGACCTT 

ATGTGCGTCTGTCTTCCCTTTCTNTATTATTTCTCTGTCTCACCGCTCTN 

TGCTCTGTCGCTCCTATCACTAAATTATGTCTCTATCACTGCTACTATCT 

GAAGCTGATCTTCGAGATCTCGCTNGGTGTCACTCTTTATCTCATAGNCG 

CCTCTGTCTTCTTGTCTCCTTAAGNCTGATTTTCTCGCTCTATTCGTGAC 

TACTCTGCTGTCTCTCACATACGTGTTCTTGAATCGTATTCGCGTTCTCG 

CTACTGTGATATCCATTGCCGACCTCTACTGCTCNTCTNTATGCTATACT 

TCTTAGTCTCTTACTACGTTNGTCTGATATNTTGCTGACGACGTCATGTC 

ACGCTCGCAACTCTTCANTTCTATCGTATACGCTGATCATCATTTTCTGT 

GAGGCTGATGTACTATACGTAATTACCTGTATACGTCGTCTATCTACTCT 

CGTGTCTTCACTCTTTCTACTCC 

Sec ID No 91 (>Contigl2) 

CCCCCCGCCGTCCGCCGGTACGTCGCGGACCGCCGCCCCGAGCAGCTCCC 
CGCGCTCGCGCCGGAGGAGCGGGAGGCCGCGGCGCGCCGCCTGTCGGCCC 
TCGGCGCGGCGCCGCCGCAGGTCCGGCGCCGCGGGCTGACGCGGGCGCCG 
CTCTCGTACGGGCAGAGCCGCATCTACTTCCTCGAGCAGCTCTCGCCCGG 
CAAGCCGCTCTTCAACGTCCCGGGCGCGGTCCGGCTCCGGGGCCCGGTCG 

ACGTCGCCCGCCTCTCGGCAGCGTTCGGCGAGATCGTGCGGCGCCACGAC 
GCCCTCCGCACGTCGATCGCCAACGTCGACGGCGAGCTCCTGCAGATCGC 
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GCAGCCGCACGCGGGCTTCGCGCTCGACGTGGTGACCTCGACGCCCGAGG 
AGGCGGCCGAGCTCGACCGGCGGCTGCGCGCCGAGGCGTGGCGGCCCTTC 
GCGATCGGCGCGCCGCCGCTCCTGCGCGCCACGCTGTTCCGCCTCGCGGA 
GGACGAGCACGTGCTCCTCGTCACGATGCACCACGTGGTGTCGGACGACT 
5 GGTCGCTCGGCGTGATCCTCCGCGAGCTCCTCGCGCTGTACGCGGGCCGC 
TCGCTCCCGCCGCCGCGGCTCCAGGTCAGCGACTTCGCGGCGTGGCAGCG 
CGAGATGGTCGAGTCGGGGGCGCTCGACGGCCAGCGCGCGTACTGGCGAG 
AGCGCCTCCGGGGGCTGTCCCGGGCGAGCATCTCGGCCGGCGGCGGGGCG 
GAGGCGCCGAGCCACGACCCGTCCGGCGCCATCGAGGAGATCGCGCTCTC 
1 0 GCCGGACAAGGCGGCGGCGCTCGAGGCGCTCGCGCGGCGGGAGGGAGCGA 
' CCCTGTTCATGGTGCTCCTCGCGCTCCTCGACCTCGTGATCCATGCGCGG 
TCCGGCGCACTGGACATCGCCGTGGGGACGCCCATCGCCAACCGGAACCG 
CCCGGAGCTCGAGGACGTGGTCGGCCTCTTGACGAACACGCTCGTGATCC 
GCGTCGATCTCGCGCGCGCCGGGGCGTTCCGCGACGTGCTCGCGCGGGCG 
1 5 CGCGTCCAGGCGCTCGACGCCTTCGCGAACCAGGACATCCCGTTCGATGT 
CGTCACCCAGGATCTGAAGCAGGAGCGCGACCACGCGCAGCACCCGCTCT 
TCCGCGTCTGGCTGGCGCTCCAGAACGCGCCGAAGCCCGCGCTGGAGGTC 
CGCGGGCTCCGGGTCGAGCCCCTGCCCCTCCGGCCCGAGCTCGTGCACTT 
CGAGGTCGCCCTCCTGCTCTGGCCGGCGGACGACGGATCGGTCGTGGGGC 
2 0 ACTTCGAGTTCCGGCGCGATCGCGTCGACGAGGGCGCGCGCAAGGAGATC 
GCGGCCGCATTCACGCACCTCGTCGACGCGGTGATCGCCCGGCCGGACGC 
GCCGGTGTCGACGCTCGTGGAGGGCGCCCGCGCCGAGGCCGCGCGAGCGC 
AGGCCGCGCTCGGCGAGGCGTTCGCCAGGGCGGCGACGGCGCGCCTCGGC 
CAGCTGCGGCGTCGCTCGGCGGGCGACCGGACGCCCCGCGAGTAGCGGTC 
2 5 AGCCCTCGGCGGCGGCCAGGCGCACGCGGAACGGCGCAGGGTAGCCGTGG 
. ACGCGCGGCATGGGGTCGATCGCGCTGGGGACGCCGGCCCGCAGCAGCTG 
CTTGATGGCGAGCGAGATGTGCAGGATGGCCACGTACTTGCCGTGGCACG 
TATGGATCCCTGACTCCCAGAACAGGTAGTTTCTGCTCGGCAGGCGCCCG 
GGCCTGAACTGGTCCGGGGCGTCGATGTGCTCGTGGTCGTGCATCGCCGA 
30 GGCGCTGCAGGCCATCACCAGCGCGCCGGCGGGGACCTTCTCCTCGTGCC 
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GCGTGCCACGCCCGACCGTGTAGTCGCGCACGCAGAGGCTCGTGACGCCG 
GTCGACGGGGGACGGAAGCGCAGCGCCTCCAGCACATAGCCGGTGATGGC 
GGCGTCGTCCTCGACGTTCACCACGTTGAGCGCGTCGCGCAGGACGCGCG 
GGCGCTTCATCAGCTCGACCAGGGCGTTGACGATCGCGCCGCCGCTGAGA 
5 TCCACGCAGCCCATGAGCAGCCCCAGGATCACGTCGCGGATCCCCTCGTC 
15 GCTCTCGTAGGTCTCGGGGACCGACTGCATGACCAGGTAGCGGTCCAGCA 

CCGAGGGTTGCTCTGGCGGGGGCGACTTGGCCAGCTGCTTCTTCCGCGCG 
GCGACGATCGCGTCGATCATCGGCAGCGCCTCCTGACGAGCGGCCCTCGC 
CGCCGCCACGGCCGTCGGGTCGTTGGTCGGGTTGAGGAAGATCTCGTTGA 
20 io ACAGCGCGTGGGTCCACGCCACCACCTTCTCGGTCGGGATCTCGCCGACG 

CCGAGGTACCGGGCCATCGCGCCGGCCGGCACCCTGAGCGCGTAGTCACC 
GGTGAGATCGAACGGCTTGTCGACGCCGACCTTGGCGAGCAGCCGGTTCG 
CCTCGTCCACGACGATCTGACGGTAGCGGGGCAGATCGGCGCGCGGGAAC 
GCGAGGCGCAGGAGCGACTTCTCGTGCTCGTACTTGGGCGAGTCGTTCAT 
CGCCAGGATGTTCTGGCCCACGTTCTCGACCAGCTTGGGCGCGATGTTGT 
CGACCGAGAAGACGTCGTTGGCGTTGAGGACCTCGACGACGTCGTTGTAC 
CGGGTCACGAGCGTGATGGCCGGGATGGAGAAGATGGGCTTCTCGCGCCG 
CAGCTGGCTGAGGAACGGGAGCGGCTCCTCCCTCAGCCACTTGAACACCA 
TGCCGGCCTCGATCTGCTTCCGCTTGACCGGATCGTTCTCGTGCGCCAGC 
2 0 GCGCTGTGGAGGGCCTGCAGQTAATCGAACGGCGGCGCCTTGGCAGCGTC 
CGCTCGTCCCTCTTCTTCGATGTGAATGCTCATGGGGAGAATTCCTTTCT 
CGCATGCCGATCAGATCGCGACGCTCTGGGGGACCATCGACGGGAGCAGG 
TACAGGTACGGCTGCTCGCGGGCGCGGTTGCGCTCGGTGATCTCCCGCTC 
GATGTGCGCCAGGCGCTCGCGGTAGCGAGCAAACGCCTGCTTCGCCGCCG 

40 

2 S GATCTTGCAACAGGCACTTCATCGTCATCAGGCTCTCGCTGGTCAGCGGC 

GGGCCCATGCTGAGCGAGCGGCTGAGCACCATCTGGCGGATGCTCTGCGC 
GCGCCCCGGGAGGCGCTCCAGCGGCTTGAACTGCCGCTTCTCGCTTCCGT 
45 TCAGGACATCGCCGTAGGAGCGGTAGGTGGCAAACTGAGCGTTCGGGATC 

CAGGTGTAATAGTCCGTCTGCCCGAAGTTCACGGCGGCGTGATATGCGGT 

3 0 CGCCGTGAAGATGATGTTGGTGACGATCGCGATCAGGTCGTCGAGGCTCG 
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TGAGCTTCTCGAGCTGATCGGCTCGCTCCGGCGGGAGGAGGCTATCCATG 
CCGCCGAGCTGGGGGGACACGAGCTCGTGGATCCACCGCTGCAGGCTGGC 
GTCGCTCGACAGAGACCCCGGCGTCGGGTAGGCGATCTTCAGCACCTGTC 
CGACGTACTCCTGGATCGCGTCCCAGTGCAGCAGCGCGTCGTCGCGGTAG 
5 TGATAGCCGACCAGGTCGCGGACGTCGCGCGCCGACAGGTCGCGGGGGAG 
CGCGCTCTCGTAGAACCGCCACGGCTTGCCGCCGTACCCTTTGATGCCCT 
TGCCGGTGTAGGCGCGCGTCAAGAGCTCGAACGAGCCCATGGTGGCCACC 
GAGCTCGTGATGTCGAAGAAGCGCCCTCGCCCGAGGAAGCGCCGGCGAGC 
CAGCTCGTTGATGGCCAGGGTGTTGAAGAAATGCGGCCTGAGCAGCTGGT 
1 0 GGAGCGGATGCGTCGCGGGCAGGTTGCGGTAGGTGCTCACCGCGAACGGC 
TCCACGATCAGGTGCGCGTACAGCAGGTGGGTCACCTGGCCCTGGTAGAT 
GGCGTCGGCGCTCGCGACGGCGATCTTCGCCGTGAGCCAGTCGTCCGACG 
GACCCGAAGGGGTGAAGATCTTGTCGGGATGCGCCCCTTTCCCGGGGCGC 
GAGTGCACCAGCCTGATGGCCACGGGCAAGAGCTCACCGGCCGCGGTCTG 
1 5 GTGCAGCATGCACGTCGGCGCCAGCGGGTACTTGCCCAGCTCTTCCTGCA 
CGTCGGTGTCGACGATGTCCTTGAAGATGCGGTAGTCGAGGAAGTAGAGC 
TGCCCGCCCTCGCGCACCTCCTCCAGCGTGCGACCGTCGGCGATCGCGAT 
CGGCTTGGGCTCGGCGCCGCTCACGAAATCGGCGAGATCGGCCGGGGTCG 
CGCGGCGGATGTGCGCCGGGTTGATCCCCACGAGGCGCTGCCGCCCGAAC 
2 0 TCGGCGTCCTCGGCCCAGCGCGTCGCCACGAGGGGCTTGCGGATGAAGGT 
CCACGGCTTGAAGAACTCCTCGAACTGATCGAAGCTCTCCCAGTTGTCGA 
TGGACTCGAAGATGGCGCCCAGCCCGAGGTCGGACGTGGCCCTGAGGACG 
AACTTCCCCTCGCGATAGCGCTTGTACCCGTACTCGAACAGGTGAAGCGC 
CTGCGCGATTTGCAGTCCGGCAGTGTCCTTCCACTTGCCGAGGTTGAGCG 
2 5 CCAAGATCTTCTTGATCGGAAAGCCTTCCCCCGGCGGTACGGAATCGCTG 
CCCGCCGGCAGGTTGGACCCGAAATTGCGCCAGTTCGGTGTCGAGCTGGG 
CTCCCTCATGCTCGTGCTCGCTTCTCCGTCTCAGACGGACGGTGGATTGG 
GTGGTTCACGTCAAACATCGCTCTCGCGTCGCAGCGGTCCGAGCGCGCGC 
CGGAATGGTTCCGTCTCAGTCGCAACAGGACTCAGTACATCCAGCGCCGC 
30 CCCCCGTCCTCGACCTGCCCCCGCAGCCGATCGCGCCGCCCTTCATCGTG 
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GAATCGACAGGTGCGATTCCACGAAAAGCCGCCGCGCCGAGTTGCACGCG 
ACCGATGCTCACGCGTGCATTGTTGAGGCTGCTAGAAAACCGTGGAGCGT 
TCACGCATGTCAAGCCATTTTGTTCGGCGCCGCGGCGAGCGGCCGGATGC 
CGCGCGCCCCCGCGCCGGGCGTGTTCGCTCCCGACGTACCGCTACCTCGA 
CGACGTATGGCTTGAAGGGCAACCGCGCAAGTCGTCCGATTCGTGCTCGT 
ATCCTGCTCCTTCCAGCAGGATTTCCCCGCCGCCAGCGGCACAAAGGTGC 
CAGGGCGAGCAGAAAGAGCGCTGCCCGCCCCTCCCCGGCCGCGCTCGCCT 
CGATCAGCGCGCGCTCGTCGTCGGTGCCTGAGACGTTCGACGGACGTCAG 
TTAGTTAGCCTAGCTAACTTCAACACTGATGCGACTGATCGGGCCGACGC 
20 10 AACCGACGCAACCGACGCAACCGACGCAACCGACGCGACCGACGCAATCG 

ACGCAACCGACGTGACGGACGCTGGCGACTCGAAGAAAACCACGGACGCA 
CTCCACGTCATCGACGTCATCGACGTCATCGATGCGCTCGATGCAATCCA 
TGCACTTGACGCGATCGGTGCGAGCAGGCGACGAGGTCCTCTCGTGAAAC 
ACCGAACCGAGTGCCGGTAGCGGGCGCGCCGCAGTGTATGCTAGGCTCGG 
CCCTCTTGTCGAGGCCGCGCGCTCGGCGGTCGAGCGTGGGCTCGGGTGCC 
GCGGTATCCGGCTGAACCAAGGAGGAGCGAGCCATGCAGGCAGATGACGA 
CGCGACGATCTACAAGGTGGTGGTGAACCACGAGGAGCAATACTCCATCT 
GGCCGGCGGACCGAGAGAACCCGCTCGGCTGGACGGAGGCCGGCAAGACG 
GGCAACAAGGCGGAGTGTCTGGCGTACATCCAGGAGGTCTGGACGGACAT 
2 0 GCGCCCGCTCAGCCTCCGGAAGAAGATGGCCGAGAGCCCCTGAATCGCGG 
35 CCCGCCCGAGCGCCCGTCGCGAGCGGCCGGGCGGCGGGCTCAGCCGTGTC 

ATCGTCGCGCTCGACCGGCCGCGTCCCGCGGGATCGCGCGAGCCCGGCGG 
GGTCGTGCGCGCCGGCGCTTGTGCCGGGGCCCCCGCTCTCGTACGCCTCC 
GTCATGCCGCCCCTCGATCTGCACGTCGCCTTGTTCGGCGCCTCCGGCGC 

40 

2 5 CGGCAAGACGGTCCTCCTGGCAGCCTTCTACCGGGCGCAGACCCAGCCCT 

CGTTCCAGCAGGAGTACGCGTACAAGATGCAGGCGGTCAACAAGGCGCAG 
GGCAACCAGCTCCTCGGCCGGTTCTATCGCCTCGAAGAGGGCAGATTCCC 
45 GGACGGCAGCACGCGCTTCGACGAGTACGAGTTCGACTTCTTCCCGAGAG 

ATCTGCCCGAGCCGGCGGTCCGCATCCACTGGTACGACTACCCGGGACGC 

3 0 TGGTGGGAGGACGAGCCGGTCGACGCGGACGAGCGGGAGGCGATGCGCCA 
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GGGCCTCATCCGGCTCGGGATGAGCCAGGTGGGCATCCTCCTCGCGGACG 
GCGCGAAGTACCGGGCCGAGGGCACCGGGTACATCCGGTGGCTGTTCGAG 
CACTTCGCCGACGAGTGCGACCGGCTGCGCCGGGCCAGCGCCGCCACGGG 
CGACGAGGTGAGCTTCCCGCGGGAGTGGATCCTCGCCCTCAGCAAGGCCG 
5 ATCTCTGCCCGCCGGACTACAGCGCGCGGGACTTCGAGCGCGAGGTCTGC 
CGGGACGCCGACGATCAGCTGGCGAAGCTCTGCTCGGTGCTCCGCGCCGA 
GCACGCGTTCGGCCACCGCTTCATGCTGCTCTCGTCGGTCGCCGCCCCGG 
CCGGCGCGCAGGTCGATCCGAGGACCTCGCTCGGCGTGCGCACCCTCGCC 
CCCGCGATCCTGGTGAGCACGGTCGAGGGCGCGGTGCGCGAGGCGCAGGC 
20 10 GGCGAGAAAGGAGAAGTCGGCCGGAGAGACGTTCTTCCAGGGGCTGCGCG 

ATCTCGTGCAGTTCGTCGACTCCCTCGACGACTTCCTGCCGAAGCGATAC 
CAGATCGTGAGCAAGATCCTGCGGTTCATCTCGATCAAGGACTTCGCGAC 
CACCCGGCTCGACCGGCTCAAGAAGATGCGCGAGGACGCGATCCGGAAGG 
GCGACACCTTCACGGCGGTCCTGACCGCGATGGTCGCGGCCCTGCGCGAC 
1 5 GACGAGGGCGCCCGCGCCTACCACCAGAACCAGTGAGGTCGTCATGCCCG 
CGCCAGCGCCCCTCGTCGAGACATCGCGCCTCCTCTGGAGGACGCGCGGC 
GAGCACTGGGATTACGAGTTCATCTGTGTCCCCGAGATCCCGGCGCTGCC 
CGCCTGGCTCTCGACGCTCGAGGCGATGCTCGCCGACGCCGACGCCGGCG 
CCGGGGAGCTCCGCTATGGCCTGCTCGAGATCGACGATCGCGGGCAGAGG 
2 0 GCGCCGCGCGCCTATCCCTACGTGGCCGTGAGGTTCCTCGATCCGGCGCG 
35 GAGGGACTGGACCGGACGGCAGGTCCAGCACTTCGCGGCCTGGTTCCCGC 

CGGTCCCGCCCGAGGCGGTCGCGGAGTTGCCAGAAGCGGTCCCCGCCGAC 
TGGCACCTTCGCGTGCTCGACGGGCTCGCGGGGACGTACGGCTCCGGCGA 
GGTGTTCGGGCTCCCCGAGGCGACGATCCGCGCCTGGAAGCGGAGCCACG 

40 

2 5 ACGAGAGCCGGGCCGCGCGCGCGATGGCGATCGTCAAGGCGACGCCGCCG 

GTTTCGCTGGGCGGCGGCGAGGCG3CGCCGTCGCGGTGGACGCGGGTGCC 
GACATTAAAAAAAAAGCCGCCGGAGCCGCCGGCCGCGGCGGGCCTCCTCT 
45 CGGTGGGCGCGGTCCCTAGCGGCCAGGGCCGGCGATTCGGCTGCTTCGCG 

ATCGGCGCCATGATGCTCGCCGCCTTCTGTCGACTGATGCTCGCTTGCGG 

3 C TGTGCGCCTCCTCGGCGCCTGACGGCTGCGCCGCGCAGGCCATCCGACGG 
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GGGGTCGGCCCGGCCAGCGCCCGCCGGGCGACACCAGGGCATCGGCCCTC 
CGCTCGGGGCATCGATTGAGCTCTCCGAGCGGCGGTCCGTCGTCAATCGC 
CGCAGAGCTCCCACCGGGCGGAGCAGCTCTGGCCGGTGACCGCATAGGGG 
TTCGTCGGGCAGGTCCACCACTCGCCCTGGAAAGGACGCGGGTTGCAGTG 
5 CGGGAGGCACTCCACCCACCCCGACGAGCACGAGTTCCCTACCGAGACGG 
15 TCGGCTGAGCCGCGCAGAACCAGCGTTTTCCTGCGAACCAGCCGGGATTG 

CACACATCGGGCGCCCCGCCGACGGGCGGATACACCGTGGCGACCGCCTG 
GATGTCCACGGCGTCGAGGGTCTCGCTCCCGAAGCGCACGCCGTCCTGAT 
CCGAACACCCACTCCAGTAGGTCATGATGGAGTCGTAGTCGTAGAAACCA 
20 10 GGGTTCACGACGATGTACCGCCGGCTCGAGGGCCACCCGCTGGCGACGTC 

' GCTCGCGGGGAGCGGCTCTCGTTGGCTGCAGGCGCTCGGGACCAACGGGT 
GATGCCACTCATGCATGAAGCCGATCGCATGACCCATCTCGTGGATCGCG 
TACTGCTCCACGCAGTCGAAGCTGTATTCGACCCGGGCTGTCTGCCAGTT 

25 

GTACTTGATGCAACGGTTGAAGTCGGCGCCCCAGGGCTTGAACTGGACCG 
1 5 AGCCGCCCTTGTTGTAAACGCCGATCGAGTCCGATTGGTTGGGCGCGTCG 
GGGTGGATCCTGACGCCGACGTAGGTCATGCGAGTGGCCGGCAGGAGCGA 

30 ATCGCAGCTCTCCCAGCCGGTGAAGCGAACCGAGCTCCAGCGTTCCCAGC 

TGCCCTGGAGCGCGGTGCGCACGCGCGTGATGACGTCCGCGAGCGAGGGG 
TTGGGCGCATGGATCAGCCCGCCCGCGGCGCCGTCGACCCTCTGCTCCGC 
2 0 CGAGCTCGTGGGGTCGATGCAGACCGGGATCCGGACATGGCCGTCAGCGT 

35 CCTCAGGCCAGCGACTCGCGCTGTCGAAGACGCTCGCCTCGGCGGACCGC 

GGCGCGGCGGAGACGGTCAGCGCGGCGCCCAGCGCTGCGAGGAGCAGCGG 
ACCGAGCGAAGAGCGAAACCGCACATGTCGTTCAGGGCCCCGCGTCGTGC 
GGTGCACCGAGACAATCTCGAGCGGGCTCATGGACGCAAACGCGTTGCGA 

40 

2 5 TGGCCTTGCAGCATGTTCTTCTCCAATCGACGAGGGTTGTTCTGCTGAAC 
GCGGCTCCAGCGTGGAGCTCGACGCGGTTCACCGGCTTCACGCCGGGGCC 
GTGGACGAGACCCGAGCACGGGGGAGGTCGCAGCCGCACCGGCTCGCGGC 
45 GCCTCCACCCTGCACCTACGACGAGCCTGCCGCTCGGTTTCGCGGAAAAT 

GCCACCCCGCTGCCCAGCGGGCGAAGCGCGGACGAGGCGCTCGTCCCCAC 
30 G3TAGCGCCGGTGCCGCTGCATCCACCGCGCTCCTCCATGGGTCGCTGCC 
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CGCGGGTCGTCGAGGAGACGGACCCGGGGCGCGGATCCCTGGCTCGGCGT 
CGCATAGCTCGTAGGGGCGGCCTTGAGCCGGCGGTACGAGCGGCGCAGTT 

10 

CAGCAGCCGACCACGTGGACGCGGCGCGCTCCGAGCTGCGCCGAGGAACC 
CTTCAAATATTCAGATGGAATTCACAGGGTGGCTGAGAGACGGGGAGTAA 
5 GATCTCAGAGATCTCCCTGCCTACCCGCATCCCTGTTCAATTTTCCGCCC 

15 ACAACGCGAACGGATGAGGAAATATCAGCCCGCGATCCCGACGGCCGACA 

GCATCAAAGGCCGCTCGAATCCAGGGGATTCGAGCGGCCTCGGTCGCGCG 
GACCCCCGCCGCGAGCCGCTTTGTCACCACTTCACCACTTCAGAGCTTCG 
ATCATCTTCTCACCATAACGCGTGCCCATGATAACAACGGACGCATGATC 

20 1 0 GAAGTGGTACTGATCCATGACGTTGGTCCCCTGCTGGCTGACCCAGTACC 

CCATAGGCAGCATGTCGGCCGCTTGGTGCACGAGGTTGTTATGACCGCCG 
CAGCACCCGCCCGCGGGGAGCTCTCCGAGAATGAAGGGAACGTCGTAGTC 
GACCCCCCAGGCTGCTTTCACCTCGTTATAGAGCTGAACGACCTTGCCGG 

25 

GCCACGAGCTCTGGCCGTTGTCGGACTCACCCTGGTGGAAGATGATGCCC 
- 5 GCGAAGCGCGCGTTCTCGGCCGTCTTCGCTTTGGCGATCTTGTTCAAGAT 
CATCTGGTGATGCGAGCCACCAGTGATGAACGTGTTGATCGACTCGCCGC 

30 TCTCAGCGGTAGCGACCAACCCGATCGTATCCCCCTCAGGCAGCTTTCCG 

AGCAGGGTCTTGCCGAACCAGATGCCCGGGTCGACGGAGGTCGACAGGTT 
CCATCCTTTTTCACCAGGGCAATCGCTGAGCGGCGGATTGGCCAAGTTCC 
20 ACTGTCCGGCCGGCTGATTGCATCCGCCGAGGACCTTGAGCCGCGCGTCA 

35 GAATTTTTGTCGCTGTCCTGTTTGTCTGCC3ACACCAGCCATATTCGACTG 

GCCCATGAGCATGAAGATGTGAAACGTCGGACTCGCGCTCGGTGCGCCGC 
CGGTGCCTGCCCCGCTGCTAACGGATCCGGTCCCTCCCGTGGCGTCACCT 
CCAGTTCCAGCGTTCGTGCTGCCTGTCGCGTCGCCCCCGGTCCCCGCGCT 

40 

25 CGTGCTGCCTGCCGTGGCGCCGCCGGTCCCCGCGCTCGTGCTGCCTGCCG 
TGGCGCCGCCGGTCCCCGAGCCGGCCCCTCCGGTGTTGTCGTCCTCACCG 
GTCGCGCCGGACTCGCCACAACCGGACGCAGCGATGATGAAGAGGAATGG 
45 GAGGAGCAGGAACCTGGGTGTGCCTCGGGTCGTGCGGTTCATCTCGGTCA 

TGATCGTTACCTCGTCGCGCCGGGGCGCGATCTGAAGAGCATGGCGGAAT 
3 0 CGGTAGGCCGGCGTCGCGATGCCGGCGCGGCGAACCTCGCCCGCAAAGAG 

50 



55 



10 



WO 00/22139 PCT/US99/23535 

212 



CTCAGCGCCGGGGCCTACCTTATCGCATCTTGGGCGCTTGGCGTCCAGGA 
TTCGGCCTTAGACAGCACAAGCAGAAGACCTTTGACACTGGATTTTTTCA 
TCATCGGCGGCGCTCGTTCTTCGCTGCGCCTCAAGCGCCGACCGTTCGTT 
TCGAAGCGAAGCGGTTTCACGATCCGGGATCGCGGAAATTTGAAACGGAC 
5 GCGTCGCGCGGGCAACGCAGGGGACTCATCACGAGGCAACCGCGCTGCGT 
15 CGCGAAATTGGCCAGCCTCTCGGAGTCCCTAGTTCCGTGCGTCAGACGCG 

TCACCCACCATGTCGAGCTCGGCGCGGCGCTCTACGTGCTTGAAAGACCT 
CCGCGAGCGCCGCGCTCTGCGCTCCGCGAGCGCACCAGGCTCCCCGTGGA 
TTCAGGGCAAGGCGGTCGTGATCACGTCCTTCGTGGTCGCCGTGCTCTGG 
20 10 CCGGGGCGGTCGACGCAGGACGGGTAGTACTGGCCGGCCCCGTTGAGCTT 

CTGGCAGACCCGGTCGCACGACCCGGCGATGCGGATCAGCCCGCACTCCA 
CGATCTGACCCCCGGAGGTCACGTGGCCCGCGGCGCAGTCGCGCTGGTAG 
GCGCGCGAGTTGTCGACCGTCGCGCTGTTGTAGCAAGCGTTGATATAGGG 

25 

TTGCGCGGCGAACAGGTTGCCCCAGAAGGCCCCCTCGACGTCCGGATAAT 
1 5 CGATGAGCTCCTGGCTGGAGGAGAGCGTCTTCAGCGGATCCCGCAGCGAG 
CGGGCGGAGAGGAGCACCGGTACTTGATAGTAGTTCACGCGCGCCGCCAC 
30 GCAGCTGGACACGATGCGCTGCCCTGCGTCGTCGAGCGGCCCGCTCGCCC 

ACGCGGGCGCGACGCCGAGCAGCCCGGGGTAGCGCTCGTCGTGCCTCTTG 
CCGTTCGAGTCCGTCCACGAAAAATCGAAGGAGGCCGTGCTGCTCAGGGC 
2 0 GCAGCTCGCCGCGTAACGCAAGAAATCGCGCGCCAGCGCGCCGCTCGGCC 
35 CGGGATCCTGGATCGCGGCGAGGTTCCGCGCGCTGAGGCCGCTCAGGTTC 

AGGGCGTTCAGGTTCAGGGCGTTGAGGTTCAAGGCGTTGAGGTTCAGGGC 
GTTCGTGCTGAGCGCGTTGCCGCCCACGAGGGCCCCCTGGGATTCCCCCA 
CAGGCTCGCCCCACGCATCGGCGTCCACCACCTCGGCGGCGCAGCCCGAC 

40 

2 5 AGCACCCCTGCCCAACCAAGCACGATGAATGTCCGCTCGAGAGACATGGA 
TTCCCCCGTGTTCCTGGCGCATGACCCGACGGCGCCCTGCGCGCGGCGCG 
CGCGGGCTCCCATCGATTCGCTGGATGGGTTCAATATTCTACTTTTTCCC 
45 GCGCTCTCGCGCCGGTGAAAGTCGCTTCAGCGGCGGCGAGGTCGATGTCA 

GGAGCGTCCGACTCCGTCGCTCTCGTCAGCTCCGCGTACCAGCGACGGAG 
30 TCGCCCGCCCATGACGGTCGGAATGGTAGAGGCGGCCGCGAGGGCGCGCT 
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CGAGCTGCGCCCGGGCGTCGGCGCGCCGGCCGCGGCGCAGGGCGGCGAGC 

GCCCGCGCCTCGAGCACCTCGATCCGCTCCTGCCCGACCGAGCAGCGCGC 

GGAGCGCTCCTCGAGCGCGGCCCACGCGGCGCGGTCGTCGTCGCGGGTCG 

CGAGCTCGATCATCGCGCAGAGCACGTCCTCCGAGGGCTTCAGCGCCTCG 

CAGCCGGCGTCGTCTCGCGCCGCGCGGAGCCGCAGCGCGATCCGGCGAGC 

GCCCGCCTCGTCGCCCTGGTAGAGGCGCAGGCGCGCGATCAGGAGCGTTA 

CGACGACCGGCGCGTGGCGATCGCCGCAGCGCGGCGCTATCGCCTGGACG 

GCGCGCGCATGGGGCCTCGCGGCCGCGAGATCGTCCATCAGGTACAGGTA 

CTCGGCGAGGTTGTAGCGGCCGACGAGCTCGAACGCGGGCTGGCCGAGCT 

CGCGCCCGAGCGCGATGGTGCGCTCGAAATCGGCGATCATCCCGGCGCGA 

TCGCCCTGGAGCGCCCGCGCGAGCCCGCGGTTGTTGAGCGCGGCGCCGAG 

GTGCATGAGATCGCTGCGCTCCTCGCAGCTGAGGATCACCGCGTCGAGGT 

CTCGCGCCGCCTCCTCGACGCGGCCGAGGCTGGCCAGGATGAAGCCGAGC 

AGCAGCAGGGCGATGATGTGCGTCTCGTGGCCCTCGTCCCCGAGCCGCGC 

CGCCTGCGCCGCGGCGCGCGTCAGCACCGCGGCGGCCTCGTCCTCGCGGT 

CGGCGCGGTGGAGCGAGCGGCCCACGCCGAGGAGCAGGCGGGCGCCGAGC 

AGGGGCGAGGCCACCCGGCCGGCGAGGCGCTCGGCGGCCGCGACCCGCTC 

GCGCGCGGCCCGGTACTCGCCCGTCCAGTCGAGGATCATGGCCTCGTCGA 

GGAGGAGCTCGATCTCGGCCCCCGCCTCCGACGCCGCCGCCGCCGCCTCG 

CGCGCCGCGGCGAGGTCGGCGAGGGCCTCGGTGTGGCGCCCGAGCCGGAA 

GCGAGCGAGGCCCCGCGCTCGGCGCTCCTCGGGGAGCAGCGCGCCGAGCA 

GCGCCTCGACGCGCCCGTAGCAGCCCTCGGCGTCGAGGTAGGCCCGGCGC 

GCGGCCGCGAGCTCGGCGCCGCGGGCGAGGAGCGACGCCGCGCGGGCGGT 

CAGGCCGCCGCGCTCGCAGTGCGCCGCGAGCACCAGCGGATCGGCCTCGC 

CCGCGGCCTCGAGCCAGTCGGCGGCGAGGCGGTGGCCGAGCGCGCGATCG 

GCAGCTCGCCGCGTAACGCAAGAAATCGCGCGCCAGCGCGCCGCTCGGCC 

CGGGATCCTGGATCGCGGCGAGGTTCCGCGCGCTGAGGCCGCTCAGGTTC 

AGGGCGTTCAGGTTCAGGGCGTTGAGGTTCAAGGCGTTGAGGTTCAGGGC 

GTTCGTGCTGAGCGCGTTGCCGCCCACGAGGGCCCCCTGGGATTCCCCCA 

CAGGCTCGCCCCACGCATCGGCGTCCACCACCTCGGCGGCGCAGCCCGAC 
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AGCACCCCTGCCCAACCAAGCACGATGAATGTCCGCTCGAGAGACATGGA 

TTCCCCCGTGTTCCTGGCGCATGACCCGACGGCGCCCTGCGCGCGGCGCG 

CGCGGGCTCCCATCGATTCGCTGGATGGGTTCAATATTCTACTTTTTCCC 

GCGCTCTCGCGCCGGTGAAAGTCGCTTCAGCGGCGGCGAGGTCGATGTCA 

GGAGCGTCCGACTCCGTCGCTCTCGTCAGCTCCGCGTACCAGCGACGGAG 

TCGCCCGCCCATGACGGTCGGAATGGTAGAGGCGGCCGCGAGGGCGCGCT 

CGAGCTGCGCCCGGGCGTCGGCGCGCCGGCCGCGGCGCAGGGCGGCGAGC 

GCCCGCGCCTCGAGCACCTCGATCCGCTCCTGCCCGACCGAGCAGCGCGC 

GGAGCGCTCCTCGAGCGCGGCCCACGCGGCGCGGTCGTCGTCGCGGGTCG 

CGAGCTCGATCATCGCGCAGAGCACGTCCTCCGAGGGCTTCAGCGCCTCG 

CAGCCGGCGTCGTCTCGCGCCGCGCGGAGCCGCAGCGCGATCCGGCGAGC 

GCCCGCCTCGTCGCCCTGGTAGAGGCGCAGGCGCGCGATCAGGAGCGTTA . 

CGACGACCGGCGCGTGGCGATCGCCGCAGCGCGGCGCTATCGCCTGGACG 

GCGCGCGCATGGGGCCTCGCGGCCGCGAGATCGTCCATCAGGTACAGGTA 

CTCGGCGAGGTTGTAGCGGCCGACGAGCTCGAACGCGGGCTGGCCGAGCT 

CGCGCCCGAGCGCGATGGTGCGCTCGAAATCGGCGATCATCCCGGCGCGA 

TCGCCCTGGAGCGCCCGCGCGAGCCCGCGGTTGTTGAGCGCGGCGCCGAG 

GTGCATGAGATCGCTGCGCTCCTCGCAGCTGAGGATCACCGCGTCGAGGT 

CTCGCGCCGCCTCCTCGACGCGGCCGAGGCTGGCCAGGATGAAGCCGAGC 

AGCAGCAGGGCGATGATGTGCGTCTCGTGGCCCTCGTCCCCGAGCCGCGC 

CGCCTGCGCCGCGGCGCGCGTCAGCACCGCGGCGGCCTCGTCCTCGCGGT 

CGGCGCGGTGGAGCGAGCGGCCCACGCCGAGGAGCAGGCGGGCGCCGAGC 

AGGGGCGAGGCCACCCGGCCGGCGAGGCGCTCG<3CGGCCGCGACCCGCTC 

GCGCGCGGCCCGGTACTCGCCCGTCCAGTCGAGGATCATGGCCTCGTCGA 

GGAGGAGCTCGATCTCGGCCCCCGCCTCCGACGCCGCCGCCGCCGCCTCG 

CGCGCCGCGGCGAGGTCGGCGAGGGCCTCGGTGTGGCGCCCGAGCCGGAA 

GCGAGCGAGGCCCCGCGCTCGGCGCTCCTCGGGGAGCAGCGCGCCGAGCA 

GCGCCTCGACGCGCCCGTAGCAGCCCTCGGCGTCGAGGTAGGCCCGGCGC 

GCGGCCGCGAGCTCGGCGCCGCGGGCGAGGAGCGACGCCGCGCGGGCGGT 

CAGGCCGCCGCGCTCGCAGTGCGCCGCGAGCACCAGCGGATCGGCCTCGC 
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CCGCGGCCTCGACCCAGTCGGCGGCGAGGCGGTGGCCGAGCGCGCGATCG 
TCCTTGGTGAGCTGCGCGTAAGCGCCCTCGCGCAGGAGCGCCTGGCGGAA 
GGAGTACTCCTCCTCGCCGGGGAAGCGGCCCTCGCGGTGGCGGACGCAGA 
GCTCCCCGGCGACGAGCGCGGAGAGGTGCTCCGCGAGCGGAGCGGCCTCG 
TCGCCCCCGAGCAGGTGCGCGACGGCGCCTCGCCAGAACACCTCGCCGAG 
CACGCTGGCGGCCCGCAGGATCCGGCGCGCGGGGGGCGCGAGCGCCTCCA 
GCCGGACCTGCACCATCGCCACCACCGTCTCGGGCAGCGCGTCGCCGCGG 
CCCTCCGCCGTCGCGCGGATCAGCTCCTCGAGGAAGAACGGCTGGCCCTC 
GGACTGGGTGACCAGACGATCGATGAGGGCCCCGTCGGCCGCGTCGCCCA 
GCGCCTCCCGCGCGAGCTGCGCGCACGCCCTCGGCGGGAGCTGCCTGAGC 
CAGAGCTCCTGCCGCCCGCGCTCGGCCCAGAGATCGGGGTACGCTTGC, 

or their complementary strands, 

(b) DNA-sequences which hybridise under stringent conditions 
to regions of DNA-sequences according to (a) encoding proteins 
or to fragments of said DNA-sequences , 

(c) DNA-sequences which hybridise to the DNA-sequences accord- 
ing to (a) and (b) because of a degeneration of the genetic 
code , 

(d) allele variations and mutants resulting by substitution, 
insertion or deletion of nucleotides or inversion of nucleotide 
segments of DNA-sequences according to (a) to (c) , wherein the 
variations and mutants offer isofunctional expression products. 

13. Peptide encoded by a DNA sequence according to claim 12 
selected from the group consisting of 
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Seq ID No 92 

>Contigll_002 591 amino acids MW=63639 D p3>5.80 numambig=0 

MLDWSTSDQVACRLHCAGAGPSASLELRYDASAGARRDAERIAERLAALLEDLSRHPER 

PVAQGEYVGPGERAEIEAWSRGPAMELPSACALHRWFEERAEQHPDWAVRSEGKSLTYG 

ELERRANRLASCLRRRGVGLDTIVGVCVPRSEDMWATLAVLKVGGAYLPLDHEYPGERL 

AFMMRDARARLLVTHDAIADELPTGGWTTLLLDAEAAEIAACSDARPAVSPPPDSGAYVI 

YTSGSTGTPKGSLISHRAIVNQMQWIQRYWALTADDRVLLKAAFGFDVSVWEIFWPLSFG 

ARIWARAGGHRDPEYLRRLVRDEGATTAYFVSSMIiAAFLiGGPEQPFPASLRKVLVGGEA 

VPLDL VRRFY A KHDGDLI NM YG P S E AA I AVTGC VL P SDPRVTV7VP LG A PV AN AE VFVLDG 

AMRRPAIGALGDLYIAGAPLARGYVGQPGLTAERFLPDPCARAAGGRMYRTGDVARFLPD 

GMLEFQGRSDHQIKLRGHRIELGDVEAQIRRVPGVGQAAWLREDAPGDARLVAYWLDG 

DAAGDAPDVRAGLKASLSAYMI PS SWRXiYALPMCSERLAFTGSSYAGCLL* 

Seq ID No 93 

>Contigll_007 361 amino acids MW=38862 D pl=10.42 numambig=0 
MSDHEMTGFSLSPQQRAIRALDREAGAPGCRTLAWAVTGPCDEGRLSAAALALAERHEI 
LRTRLVEAGRARPRRWSASRASRGRQQDDWVGCSEAEQGERMSRLVARLSEDRGADDGLR 
VGLVRVGPEERRLVLAAPAWCVDEES I APLVRELCASTAGAGAP PEQQYADVAEWLNGML 
ESEDAGDGRRFWAERRSHFGPPLKLAFSRGGAGAGAGSGRARVDL.GGGGMAQVERWSSSW 
QVPQRI VLLALWASLLWRMSGGNEF E VTVAVRFDGRSLDALAGAVGPFARFLPVRI E I SA 
SDTIJ^VARRLAlAEAEAAAHQDAAPGVSHRMSWGLLRRGGRAGAVARRRAGPRARRLEfi 

V* 

Seq ID No 94 

>Contigll_012 882 amino acids MW=95015 D pl=12.69 numambig=0 
MARALYAQEAAFRSAFNACAAAARARGLDLGEWFREGEGDGRTLLGTALAQPALFAVEL 
ALARLWMSWGIEPAAMIGKSLGELVAACVAGVFTLEDAMSLVIDRGRFMQAAPAGSMLAV 
GLPAADVEGLLEAGLEIAAWSPKITWAGPASAIRDLAARLEAREVFARPI^TSHAFHC 
ALI DGAVAPFLESVRRARLS PPE I ? WSNVTGALLTDAEATDPAYWARHLRQPVRFSDGV 
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EALFASGHALFLEVGPGRGLTTLTOQTLAGRGGAAIASLGSTHAASSPASLAEALGQLWE 
AGHAVDWTARPRGRPFARSSACRRTRSSGRGTGSRRAAAPPPPSRRRPTRPSRPRRTPSR 
RRRSPARTPGPRSPRRRPSPSARSPAIWERPPRRPRDRRPRRLLRPARRLARRDPGDVPD 
PRAARRRAPAAGALPGADGRGARGPGRRRAPRRGAAARVPPDRADPPRRPAPAWRAAPA 
WRRSARAGQPGVQHPVRGARDGPARRRRAPPQPWDRAAPRGAADDVQRQGRRAVPGRR 
ARGPGAVPDVRPRAPRGRGPRRGGLGARPRGEPRAVRPEPRAAPPRAGHPEAPRRARHRA 
RRAPRRVRRLVGRRLRRRARRALRRLRAGPAFPAAGAPGAVRRLRRSPAGVAVRRGPGGR 
APVIaDDEALGRAAPGPRAGRPRAGGAKDVARGAAEPRRRGRAHPPDQGVLRAGGDLPLHG 
PARRVQARPAPAHRARGPRRGHRRREPEPRRDRADDRLLRQPARAPDRLRGRPDVRRAGP 
PRARRRARGVRAPGPAVRPARRGPAAEGGRRARAPLRREVRHAQRARPPDEARGARARGA 
GGRGDHDGVRLRPHGRRGRRLVPLRRRAQQRAVFGGDGRQFP* 

Seq ID No 95 

>Contigll_021 1213 amino acids MW=131017 D pl=12.40 numambig-0 

MRGRRRRAAPHLRGARPARRGDRRGGLALRSDRERARRRGHGEGLGAGGRRARRAARRRR 

LPAPGPS PPRGAPQAP PGARRGP PRPHAVGGRRDDRLARGHRAPRRRRGRAVARAARRAP 

AARRKHGRPGLRDLHVRVHGAAQGRDDRPPRRGEHGPRHQPPLRRRPGGPGARALVAELR 

PVGLRRVRDARRRRRRRDPRPYPRLGSGALARARGARAGDRVELGPGADGDAHGRVPRRG 

RPGAWAPPRHDERRLDPAEAPRSHPRGLPRAPRREPRRRDRGVDLVDRPPDRGRRPGVA 

QHPLRPPAREPAHLRARRGAGAVSDRGPRRDPHRRDRRRARILARRGADPGAVPEAPHDR 

RAAVQDRRPGPLLRRRDHRAARAHRPSGEDPRLPHRARRDRGRPRAAPLGRAGGRGGEDR 

SVGREAFGRVRRRRRRRWRRAPRLRPEEAAGVHDPRGGRRPPGAAAERERQGGPRRPAGS 

RRGRAARRGRRPADRDRAAHRLGARGGLAGRGRRRDRQPIjRARLHIiAAPRARAAPARRAH 

RGARARRGRRGAGRVADRPVPVPDHRATGAAARRGDGQGRAGRRRGAARRGAPRRATPPG 

ARIAMSEPIETEDGGSDIAIVGMAGRFPGAPSVDALWENVRRGVESIARFPESEREEPPV 

GAS AAPGAP WCAGGLLDD I DRFD AS Y FGY S PREAQLMD PQQRLFLECAVAALEDAGCD P 

ARFPGAIGVFGGCGSNTYLLQLLSHPDLAATVDPHALMIASEKDYLATRVSYKLDLHGPS 

VWQTACSTSLVAVHMACESLLGGQCDLALAGGVS IG I PQKRGYPYVPGS I CSPDGRCRP 
FDARAEGTVGGSGVGIVALKRUOALRDRNTVHAVIRGSAVNNDGGRKVGFMAPSVDGQA 
AAISEAQSVAGVDPGSIGYVEAHGTATAIGDPIEVEALTQAFRRKTPRKAYCALGSIKAN 
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IGHLDAAAGVAGLIKAAHWRSGEIPPCVHFEAPNPKLDLAASPFFVPREAAPWPRELRP 
RRAGVSSFGIGGTNAHVVLEEPPPLPPRAPAPSRDHVLTLSARTPEALSTACAQLAAHLE 
ATDVPLDDVAFTLQTGRAEHPYRRAWARTRAEAIQGLAREGASALARPDEPRPSSRSRA 
RARRPSGWPARSTRRRRRSGAPSTRARRRRGRAASISARSSSARARATGARCSAPRWRSP 
RSSPSSSRSPGSG* 

Seq ID No 96 

>Contigll_026 3079 amino acids MW=332984 D pl=5.97 numambig=0 

MLTWDHHWVEYWSFALIVRELGELYSALRAGRPPQLPPPSSFFAAGVSCPSPREAAGG 

AEYWRKALDGATTAIDLPRDRARHDAGARRGRAHAITIjPKPLTGALARIiARERGTTLFSV 

LLS ALT VLLHRASGQ SDL WGVP S AG RHDDE S ARAFGY FVQMLP VRVALRGAAS FDALVA 

RVRDAFLDALAHGDSALRKLLARAQGEAQRDALFDVAFAFQSTPPSLDARLSALAIGVGD 

VRIAQGELELTTLADEQAAAEFDLALFAAELDAGIALRFEYDQQLFDPATIERMARHFW 

LLESAVEHPGRPLSELRMLSDAERALLLDDWSGAAAARQAASAPAPACVHALFEAHAARQ 

PDATALEFGHQRFTYAQLSTWSTELALWLRDRGVGPGSWGVC IERS PRMVAAQLAVLKA 

GAAYASLDPANPPARLAEMLMCRASLALTSSQASHKLTAAPCPVHLVQDGACAPSTHIP 

LV3RPDDLAYVLFTSGSTGTPKGVCVRHASLSRLVSFLHLRLDLSPSDRWTQVASSGFDA 

SVYEIWTP^CGAALLLADDDALRSPTALVSWLVAQRATLSFMPTPLAEACFEQDWTGSA 

LRAMTVGGDKLHPLRRPLPFRLFNMYGPTEATVITTVAEIADLGAEPPLGRPVDSALVYV 

LDPHMQPVPPGALGELYIGGACLAQGYTRTDLTAERFLPDPFGQPGARLYRTGDLVRWRP 

DGQLAFAGRRDEQVKLRGRRVELGEVE SVLRRLPGVREGI WLHGQGS AAHL I AH WPDA 

HPPSERDLREGMARLVPDALVPAHFVLLPALPMSLSGKVDKKLLPAPPAAHADYEPPSGE 

LELELAHIWQSVLHLDRVGRHDSFFDLGGHSLLAMQVLGRIESSLGIRTTLRTLFEHPTL 

AQLASHLSSGAASTSAAAATALERGLTRPDGPSSPRVATPEEPFALTEGQRAMWLECQKS 

ADGALYNLGRTVRLGAGVDVAALRRAFEGLVERHEALRTTFLTRDGHPLQQVHRHVALEW 

AEEPAMALDEREIVARADEVRRRAFDLERGPLLRVHVWRRGEGQPPLLTVVVHHLVVDYW 

SFALLVRELGELYSALRAGRPPQLPPPSSFFAAGVSCPSPREAAGGAEYWRKALDGATTA 

ICLPRDRARHDASPRRGRAHAITLPKPLTGALARLARERGTTLFSVLLSALTVLLHRASG 

QKDLWGVPSAGRNDDESTRAFGYFVQMLPVRVALRGAASFDALVARVRDAFLDGLAHGD 

S AIjQHLLAE PRGAARRGG A L FDVAF AFQGAL P S LD PRLAALTTGAED VR I AQGELELTTL 
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ADEQAAAEFDIALFAAELDSGIALRFEYDQQLFDPATIERMARHFVLLLESAVEHPGRPL • 
SELRMLSDAERALLLDDWSGAAAARQAASAPAPACVHALFEAHAARQPDATALEFGHQRF 
TYAELSTWSTELALWLRDRGVGPGSWGVCIERSPRMVAAQLAVLKAGAAYASLDPANPP 
ARLAEMLADCRAALVLTSSQASHKLTAAPCPVHLVQDGACAPSTHIPLVSRPDDLAYVLF 
TS GS SGTPKGVC VRHASLSRLVS FFQHLIiALS PRDRWTQIAS SGFDAS VYE I WTPLACGA 
ALLLADDDALRS PTALVSWLVAQRATLSFMPTPLAEACFEQDWTG I ALRAMTVGGDKLHP 

LRR PLPFRLFNMYGPTEATVI TTVAEVADLGDE P PLGRP I DS ALVYVLDPHMQPVP PGVL 
GELYIGGACLAQGYTRTDLTAERFljPDPFGQPGARLYRTGDLVRWRPDGQLAFAGRRDEQ 
VKLRGRRVELGEVESAiRRLPAVREGVVVLHGCKSSAARIilAYWPGADPPSERDLREGMA 
RLVPDALVPAHFVLLPALPMSLSGKVDKKLLPAPPAAHADYEPPSGELERELAHIWQSVL 
HLDRVGRHDSFFDLGGHSLLAMQVLGRIESSLGIRTTLRTLFEHPTLHQLADRLSSGAAS 
TTAAAATVPAS E I APSLGRAPADE PYPLS YEQERL WVLEQLLPGGTAYNWOAVRLRNLV 

DVDALS SALAALVRRHWSLRTVFVAS PTPAQKI CEPEAAPAEWDLRGTPPDEAEAAARA 
WASREQATGFDLARGPVFRARLFRLDHDVCVLVLSTHHIVTDAWSFQPLVRDLAELYRRA 
RGGGPADMPELPLQYVDFAWQRRHLAGKRLADKLAHWTATLRGLPVLELQTDRPRPPVQ 
TF RGAERVLPLDARLVAQLDELARS RGATRFMVLLAALGVLLRRS SGQDD1A IGTAVANR 
PRPELEPLVGFFVNTIVMRLDLGGDPTFEELLSRARKVALEAFEHQDAPFEKWEAVNPR 
RDLSRSPLFQVMLWQNAPTEALELGEVRIEPLDLPVEATRFDLRFSVEPRGGRDVISLQ 
YNVDLFDAAT I DRMLATMQSVLSRATQDPAQRVRALSVAPEDRERALVAWNDTAVATPDH 
LRLEE PFFE RAVEQPDACAWDAERRLTYGELARRAE A I AAAAS RS GATANALVAWME K 
GWEQVAAVLGVLRAGAAYLPLDPRLPEERLRHLLEHAEVRLVLTQSAVDGTIAWPAGIER 

LAVDADERWREQ P VARRPPGGS TDDLAYV I YTSGSTGLPKGVM I DHRGAVNTVLDI NRRF 
"dVGPEDRVLALSSLSFDLS^DVFGTIiAAGGAWITDRTRASDPGHWRELVERERVTV^ 

SVPALMEMLMDAS PGAGDPALSSLRLVMMSGDW I PLKLPDR IRAACRAPRWSLGGATEA 
S I WS I AHP I ADVDPAWRS I PYGRPLANQHTYVLDEGLEPCP IGVPGE I H I GGI GVALGYW 

. RDE ARTRERFLKH PTTGERLYRTGDLGRYFADGT I ELLGRTDHQVKI RGFRI ELGE IEAA 
LAQHP S VEQAWAAKTDP SGEKRL VAYWGADGDGAALRD F VRKKL PE YM I PAEWVL PA 
LPLSANGKVDRAALPDPAAVAPRAAAVAPRTATERLIASVLAEVLQVEAVGVTDNLFELG 
FT3LLLVRAQRLLAERIAARAPDEGAAAQAVSLTDLFQYPTIEQLAQRLDAATVKAEPAD 

VGAQRAEARRDARRRRGRG * 
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Seq ID No 97 

>Contigll_011 544 amino acids MW=60164 D pl=9.10 numambig«0 

MMSRIRAQLGVELPLRALFQGPTVAALAAQVDAARRGEARRREFPPIARIPRDGPLPLSF 

AQHRL WFVDQLE PGS PAYN I P FVVRATGRLDVDALRRS LFE I ARRHEALRTT FS ARDGVP 

FPWAPEARVPFRMSDLEHLAGEALDAAVSALVLEESLAPFDLSRGPLLRVRVIRKRHDE 

HVIALWHHVVFDVWSVGVFVGELiAALYGGFAQGQPSRLPELPAQYVDFAAAQRAWLSGE 

VLEGELRYWTTKLSGALRRARVPVDHEPAGRRTWRGARRSLDAGAELTRQIKAFCEREAI 

SPFMALLAAYKLVLHQRTGLEDLWGTDVANRNRVETEPMIGFFVNQLVLRTDCGGDPTF 

GALVRRVRDVALEAFEHQDLPFDRLVEALRPKGAVGHVPLFDAKFVMRNVHVPPMKLEGL 

ELEALEGEATTTAFDF VLTVAEAGG S FRFGVEHSS EL YRAAT VDNFLS D YRQ I LATATAR 

PDTPVSELRGELERAAAARRELERKAARGAALDKLTSARRRAVTLPRPGAPGEAKTSPKD 

DLDE* 

Seq ID No 98 

>Contigl2_001 514 amino acids MW=56145 D pl«8.82 numambig^O 

PPAVRRYVADRRPEQLPALAPEEREAAARRLSALGAAPPQVRRRGLTRAPLSYGQSRIYF 

LEQLSPGKPLFNVPGAVRLRGPVDVARLSAAFGEIVRRHDALRTSIANVDGELLQIAQPH 

AGFALDVVTSTPEEAAELDRRLRAEAWRPFAIGAPPLLRATLFRLAEDEHVLLVTMHHVV 

SDDWSLGVILRELLALYAGRSLPPPRLQVSDFAAWQREMVESGALDGQRAYWRERLRGLS 

RAS ISAGGGAEAPSHDPSGAIEEI ALSPDKAAALEALARREGATLFMVLLALLDLVI HAR 

SGALDIAVGTPIANRNRPELEDWGLLTNTLVIRVDIiARAGAFRDVIiARARVQALDAFAN 

QDI PFDWTQDLKQERDHAQHPLFRVWLALQNAPKPALEVRGLRVEPLPLRPELVHFEVA 

LLLW PADDGS WGHFEFRRDRVDEGARKE I AAAFTHLVDAVI ARPDAPVSTLVEGARAEA 

ARAQAALGEAFARAATARLGQLRRRSAGDRTPRE* 

Seq ID No 99 

>Contigl2_009 582 amino acids MW-65555 D pl=8.72 numambig^O 
MREPSSTPNWRNFGSNLPAGSDSVP PGEGFP I KKI LALNLGKWKDTAGLQ I AQALHLFE Y 
GYKRYREGKFVLRATSDLGLGAIFESIDNWESFDQFEEFFKPWTFIRKPLVATRWAEDAE 
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FGRQRLVGINPAHIRRATPADLADFV3GAEPKPIAIADGRTLEEVREGGQLYFLDYRIFK 

DIVDTDVQEELGKYPLAPTCMLHQTAAGELLPVAIRLVHSRPGKGAHPDKIFTPSGPSDD 

WLTAKIAVASADAIYQGQVTHLLYAHLIVSPFAVSTYRNLPATHPLHQLLRPHFFNTLAI 

NELARRRFLGRGRFFDITSSVATMGSFELLTRAYTGKGIKGYGGKPWRFYESALPRDLSA 

RDVRDLVGYHYRDDALLHWDAIQEYVGQVIiKIAYPTPGSLSSDASLQRWIHELVSPQLGG 

MDSLLPFERADQLEKLTSLDDLIAIVTNIIFTATAYHAAVNFGQTDYYTWIPNAQFATYR 

SYGDVLNGSEKRQFKPLERLPGRAQSIRQMVLSRSLSMGPPLTSESLMTMKCLLQDPAAK 

QAF AR YRERLAH I ERE I TERNRAREQP YL YLLPSMVPQS VAI * 

SEQ ID No 100 (>ORFl) 

VSQRTSCYLRGGGVCSMNDAFLALERNERNRPSTVIDLLRQRAEAEPARPIYCFLESGDVEAG 

ATWVTLREIDERARTVAALLQASGVAPGARALLLYPPGIEYITAFFGCLYAGVRTVPAYPPDL 

GRLERTLPRVASIVADARAEAALTSSAVAGIVASLPASAAAAALQRLRWIATDGPSPGPIEGP 

GAALRPESVAFLQYTSGSTGEPKGVMLTHGNLLHNSRLIAHGFDLTSPDPVGVIWLPPYHDMG 

LIGGILQALYRRIRVALMSPLSFLQRPMRWLRAVSALGASVSGGPNFAYDLCVRKSSEEERAA 

1JDLRSWEVAFTGAEPVRADTLDRFARAFAVSGFRREAFYPCYGLAEATLTVSGGARAEAPVLA 

RLAPEEVELGRAVASAAEGARVFVGSGRALDPRAVAIVDPAGNELGPGEIGEIWVSGPSVAVG 

YWGRPEETEATFGATLAGSAAPRYLRTGDLGFLRGGELFWGRSKDLIILRGRNHFPQDIEKT 

VESSHRAVRPGCSAAFSVEHEGEERLAWCEVDPRVAADPREIVAAREAVTAEHQLVAHAVAL 

IAPGALPKTSSGKVRRRECRRAFLEDALGERHVAFAPELLDDASPPDDAPPETEEPSGRSLLD 

ALRSTLARALRLDAGQIDDALP I S RFGLD SLAAVELQHAFQ VRTGRAI PLTS I LRGGSLRLTR 

E I TRLDGP S S PRVATPGGAVCADRWGTGRFGS S A I SRPMERFTTWAGRS GS VP AFKRVDLRRA 

F 

SEQ ID No 101 (>ORF2) 

VYSSAYVLFAVCAGTTRVASAPETAGFPLECVGDDGTVLGPDSFWGYTQVYVFKKERLNTNP 
PIDGFTLKLDGNEVAPGEDGLPWKRCVRSEEQAQGCGRTEPAEDECTTYEIEAWPEKAAEV 
DEEAAGLGGPPAREAIWVDYYTDGGEFDGARRLVSDTTGASRGGNGTTWTPPSEPGRVSLWAV 

VHDTRGGASVTRREVQVE 
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SEQ ID No 102 (>ORF3) 

WGTVLSAGTGEPLPDIAVTLVRPDGGREEAKTDQGGKFRFKNLPPGKYRVEVAAAGFEPFAA 
EEEIAAGEATEVRYRISIAAPQDGKAPGIEVTVQGERPPREVTRRTIERREIDRIPGTGGDAL 
RSLQSLPGVARSGFGLLIVRGSAPQDTLTFVDRTPVPIIYHFGGLSSWPTEMLEKIDFYPGN 

F SAVYGRAMGGI VDVGLRS PKQDGKYHGVVQLDLI DGRVLLEGPVPFLKDWTF I AAGRRSWVD 
AWLGPVLKEAGSSVTQAPVYYDYQFVLEGRPSASERVRASFYGSDDAFKITLDKPPEDEPALT 

GDFGLHTAFQRFQLS YENRI GSRDRLLWSMALGRD I ADFE I S P LAFNWSTS LDLRLELSHRF 
ARYLTMbA^GTDLSGGVATVNIRAPSQQPAGHPSNQPFSTYPFQDRSFDGAYSRPAAYAELEW 
PSPRARIVPGVRVDYALDTQTLDVSPRVNARYDIRSGFPRTTAKGGVGLYYQAPQFAESIEPF 
GNAELKSNRAVHYGLGVEQEITPQIEVTLDGFYKQLDRLWFSPEKDDYADGTGYAVGGELLL 
KYKPDERFFGWAAYTLSRSVRKDGPDEEEHLTQFDQTHVLTVLGSLRLGRGWELARFRLVSGN 
LQTPYVCDPEEKGCNPNRVNAIYHASSARYSPIPLGGDYSERMPLFHQLDIRADKTWKFKRWQ 

LGLYLD I QNVYNYMAAEGI S YNFNYTKRE YVTGLP FLPTLGLRGD F 
SEQ ID No 103 (>GRF4) 

VI AVDNNPEAVDAVKDKT S AAFVGDATVHKVLEG I GAQ YVETAI VT FGEHFE P S VLCVASLVR 
MGVRIIARAATDRQADILRAVGATRVIQLETEMGRRV^ 

HGPLVGQTLAGSKIRQRYRINVLGVRPHTNKRPGDKPRLSAPTPDYVIRDGDTLLLVGDSDDV 
SRFVAEVGG 



SEQ ID NO 104 (>ORF5) 

SGSSGGGSSAEGSRCQPSGGGPHWLLEGETVTFPVTCASGLALAGDAFEVGPLPEGAAYDPIA 
REVTFSPGLDQAAVYDIEIRVAQTSEVGRVKVGVADAFADPSNVPVVDPTRYPEEYGLPVLFL 

S P V PEDKEY AP ATV I YRGHT YAAEAELRGE S SLSYP KRS YTLKFP KDDKFNE P DEAGGFTDRR 
KWLITTFDDNSYVRQRLAYDLWNRLDPEHIQIKTYSAVLYLDGEYAGLYTVADHVDGYLMED 

HGYPQDGNLYKAVSHDANFALTDRSGDPKDTLHDGFEKKEGAPAEGEPEAFSDLEDLVSFVAE 

SDDATFAAE IGSRIDLRDYEDWWI FVTF I VANDSAGKNS YKYRDPAADGVFRYAPWDFNAS FG 
QSWETEREPASDRVDYRDVNLLFERLLEEPSIGDPLRARYDQVLRGALAEAEIHAIVDGYVER 

IDASARRDEARWGEAYRSYEGWSWRDDFTTYEEEIAYLK 

AW I SERWQHQDELY 
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SEQ ID No 105 (Contig 11 >ORFl) 

VLDVWSTSDQVACRLHCAGAGPSASLELRYDASAGARRDAERLAERLAALLEDLSRHPERPVA 
QGEVGPGERAEIEAWSRGPAMELPSACALHRWFEERAEQHPDWAVRSEGKSLTYGELERRAN 

RIASCLRRRGVGLDTIVGVCVPRSEDMWATIJW^ 

LLVTHDAIADELPTGGWTTLLLDAEAAEIAACSDARPAVSPPPDSGAYVIYTSGSTGTPKGSL 
ISHRAIVNQMQWIQRYWALTADDRVLLKAAFGFDVSVWEIFWPLSFGARIVYARAGGHRDPEY 
LRRLVRDEGATTAYFVSSMLAAFLGGPEQPFPAS LRKVLVGGEAVPLDLVRRF YAKHDGDLI N 
MYGPSEAAIAVTGCVLPSDPRVTWPLGAPVANAEVFVLDGAMRRPAIGALGDLYIAGAPLAR 
GYVGQPGLTAERFLPDPCARAAGGRMYRTGDVARFLPDGMLEFCGRSDHQIKLRGHRIELGDV 

EAQ I RRVPGVGQAA WLRED APGDARLVAY WLDGDAAGD APDVRAGLKASL S A YM I P SSWR 
LYALPMCSERLAFTGSSYAGCLL 

SEQ ID No 106 (Contig 11 >ORF2) 

MSDHEMTGFSLS PQQRAI RALDREAGAPGCRTLAWAVTGPCDEGRLSAAALALAERHE I LRT 
RLVEGRARPRRWSASRASRGRQQDDWVGCSEAEQGERMSRLVARLSEDRGADDGLRVGLVRVG 
PEERRLVLAAPAWCVDEESIAPLVRELCASTAGAGAPPEQQYADVAEWLNGMLESEDAGDGRR 
FWAERRSHFGPPLHLAFSRGGAGAGAGSGRARVDLQGGMAQVERWSSSWQVPQRIVLLALWAS 
LLWRMS GGNE P E VTVA VRFDGRSLDALAGAVGPF ARFLP VR I E I S AS DTLADVARRLALAEAE 
AAAHQDAAPGVSHRMSWGLLRRGGRAGAVARRRAGPRARRLEHV 

SEQ ID No 107 (Contig 11 >ORF3) . 
MSRIRAQLGVELPIJIALFQGPTVAALAAQVDAARRGEARRREFPPIARIPRDGPLPLSFAQHR 

LWFVDQLEPGSPAYNIPFWRATGRLDVDALRRSLFEIARRHEALRTTFSARDGVPFPWAPE 

ARVPFRMSDLEHLAGEALDAAVSALVLEESLAPFDLSRGPLLRVRVIRKRHDEHVIALWHHV 

VFDWSVGVFVGEIiAALYGGFAQGQPSRLPELPAQYVDFAAAQRAWLSGEVLEGELRYWTTKL 

SGALRRARVPVDHEPAGRRTWRGARRSLDAGAELTRQIKAFCEREAI S PFMALLAAYKLVLHQ 
RTGLEDLWGTDVANRNRVETEPMIGFFVNQLVLRTDCGGDPTFGALVRRVRDVALEAFEHQD 
LPFDRLVEALRPKGAVGHVPLFDAKFVMRNVHVPPMKLEGLELEALEGEATTTAFDFVLTVAE 
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AGGSFRFGVEHSSELYRAATVDNFLSDYRQIIATATARPDTPVSELRGELERAAAARRELERK 
AARGAALDKLT S ARRRAVTL PRPGA PGE AKT S PKDDLDE 

SEQ ID No 108 (Contig 11 >ORF5) 

MSEPIETEDGGSDIAIVGMAGRFPGAPSVDALWENVRRGVESIARFPESEREEPPVGASAAPG 
APWCAGGLLDDI DRFDASYFGYS PREAQLMDPQQRLFLECAVAALEDAGCDPARFPGAIGVF 
GGCGSOTYLLQLLSHPDIJVATVDPHALMLASEKDY 

VHMACES LLGGQCDLALAGGVS I G I PQKRG YP YVPGS I CS PDGRCRPFDARAEGTVGGSG VG I 
VALKRLADALRDRNTVHAVI RGS AVNNDGGRKVGFMAPSVDGQAAAI SEAQSVAGVDPGSIGY 
VEAHGTATAI GDP I EVEALTQAFRRKT PRKAYCALGS I KANI GHLDAAAGVAGL I KAAHWRS 
GEIPPCVHFEAPNPKLDLAASPFr VPREAAPWPRELRPRRAGVSSFGIGGTN^ 
PPRAPAPERDHVLTLSARTPEALSTACAQLAAHLEATDVPLDDVAFTLQTGRAEHPYRRAVVA 
RTRAEAIQGIAREGASAIARPDEPRPSSRSRARARRPSGWPARSTRRRRSGAPSTRARRRRGR 

AAS I SARS SS ARARATGARCS APRWRS ?RS SP S SSRS PGSG 
SEQ ID No 109 (Contig 11 >ORFS) 

WDHHVWEYWSFALIVRELGELYSALRAGRPPQLPPPSSFFAAGVSCPSPREAAGGAEYWRK 
ALDGTTAIDLPRDRARHDAGARRGRAHAITLPKPLTGALARIiARERGTTLFSVLLSALTVLI^ 
RASGQSDLWGVPSAGRHDDESARAFGYFVQMLPVRVALRGAASFDALVARVRDAFLDALAHG 
DSALRHLLARAQGEAQRDALFDVAFAFQSTPPSLDARSALAIGVGDVRIAQGELELTTLADEQ 
AAAEFDLALFAAELDAGIALRFEYDQQLFDPATIERMARHFVVLLESAVEHPGRPLSELRMLS 
DAERALLLDDWSGAAAARQAASAPAPACVHALFEAHAARQPDATALEFGHQRFTYAQLSTWST 

ELALWLRDRGVGPGSWGVC I ERS PRMVAAQLAVLKAGAAYASLDPANPPARLAEMLADCRAS 
LALTSSQASHKLTAAPCPVHLVQDGACAPSTHIPLVSRPDDLAYVLFTSGSTGTPKGVCVRHA 
SLSRLVSFLHLRLDLSPSDRWTQVASSGFDASVYEIVTTPLACGAALLLADDDALRSPTALVSW 
LVAQRATLSFMPTPXAEACFEQDWTGSALRAMTVGGDKLHPLRRPPFRLFNNFYGPTEATVITT 

VAEI ADLGAEP PLGRPVDSALVYVLDPHMQPVP PGALGELYI GGACLAQGYTRTDLTAERFLP 
DPFGQPGARLYRTGDLVRVmPDGQI^FAGRRDEQVKLRGRRVELGEVESVLRRLPGVREGIVV 
LHGQGSAAHLIAHWPDAHPPSERDLREGMARLVPDALVPAHFVLLPALPMSLSGKVDKKLLP 
APPAAHAOYEPPSGELELEIAHIVIQSVLHLDRVGRHDSFFDLGGHSLLAMQVLGRIESSLGIR 
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TTLRTLFEHPTLAQLASHLSSGAASTSAAAATALERGLTRPDGPSSPRVATPEEPFALTEGQR 
AMWLECQKSADG 

ALYNLGRTVRLGAGVDVAALRRAFEGLVERHEALRTTFLTRDGHPLQQVHRHVALEWAEEPAM 

ALDEREIVARADEVRRRAFDLERGPLLRVHVWRRGEGQPPLLTVWHHLWDYWSFALLVREL 

GELYSALRAGRPPQLPPPSSFFAAGVSCPSPREAAGGAEYWRKALDGATTAIDLPRDRARHDA. 

SPRRGRAHAITLPKPLTGALARLARERGTTLFSVLLSALTVLIjHRASGQNDLVVGVPSAGRND 

DESTRAFGYFVQMLPVRVALRGAASFDALVARVRDAFLDGLAHGDSALQHLLAEPRGAARRGG 

ALFDVAFAFQGALPSLDPRLAALTTGAEDVRIAQGELELTTLADEQAAAEFDLALFAAELDSG 

IALRFEYDQQLFDPATIERMARHFVLLLESAVEHPGRPLSELRMLSDAERALLLDDWSGAAAA 

RQAASAPAPACVHALFEAHAARQPDATALEFGHQRFTYAELSTWSTELALWLRDRGVGPGSW 

GVCIERSPRMVAAQIAVLKAGAAYASLDPANPPARLAEMLADCRAALVIiTSSQASHKLTAAPC 

P VHL VQDG ACAP S TH I ? LVS RPDDL A YVLFT SGS S GTPKGVC VRHAS LS RL V S F FQH LLALS P 
RDRWTQIASSGFDASVYEIWTPlACGAALLIiADDDALRSPTALVSWLVAQRATLSFMPTPLAE 

* 

ACFEQDWTGIALRAMTVGGDKLHPLRRPLPFRLFNMYGPTEATVITTVAEVADLGDEPPLGRP 
IDSALVYVLDPHMQPVPPGVLGELYIGGACLAQGYTRTDLTAERFLPDPFGQPGARLYRTGDL 
VRWRPDGQLAFAGRRDEQVKLRGRRVELGEVESALRRLPAVREGVVVliHGQGSAARLIAYVVP 
GADPPSERDLREGMARLVPDALVPAHFVLLPALPMSLSGKVDKKLLPAPPAAHADYEPPSGEL 
ERELAHIWQSVLHUDRVGRHDSFFDLGGHSLLAMQVLGRIESSLGIRTTLRTLFEHPTLHQLA 

DRLSSGAASTTAAAATVPASEIAPSLGRAPAD 

EPYPLSYEQERLWVLEQLLPGGTAYNWQATOLRNLVDVDALSSALAALVRRHWSLRTVFVAS 
FTPQKICEPEAAPAEWDLRGTPPDEAEAAARAWASREQATGFDLARGPVFRARLFRLDHDVC 
VLVLSTHHIVTDAWSFQPLTODlAELYRRARGGGPADMPELPLQYVDFAVWQRRHIiAGKRLiAD 

KLAHWTATLRGLPVLELQTDRPRPPVQTFRG^^ 

ALGVLLRRSSGQDDLiAIGTAVANRPRPELEPLVGFFVNTIVMRLDLGGDPTFEELLSRARKVA 

LEAFEHQDAPFEKWEAVNPRRDLSRS PLFQVMLWQNAPTEALELGEVR I EPLDL PVEATRF 
DLRFSVEPRGGRDVISLQYNVDLFDAATIDRMLATMQSVLSRATQDPAQRVRALSVAPEDRER 
ALVAWNDTAVATPDHLRLSEPFFERAVEQPDACAVVDAERRLTYGELARRAEAIAAAASRSGA 
TANALVAVVMEKGWEQVAAVLGVLRAGAAYLPLDPRLPEERLRHLLEHAEVRLVLTQSAVDGT 
IAWPAGIERLAVDADERWREQPVARRPPGGSTDDLAYVIYTSGSTGLPKGVMIDHRGAVNTVL 
DINRRFDVGPEDRVIiALSSLSFDLSVYDVFGTLAAGGAWIPDRTRASDPGHWRELVERERVT 
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VWNSVPALMEMLMDAS PGAGDPALS SLRLVMMSGDW I PLKLPDRI RAACRAPRWS LGGATEA 
SIWSIAHPIADVDPAWRSIPYGRPLANQHTYVLDEGLEPCPIGVPGEIHIGGIGVALGYWRDE 

ARTRERFLKHPTTGERLYRTGDLGRYFADGTIELLGRTDHQVKIRGFRIELGEIEAALAQHPS 

VEQAWAAKTDPSGEKRLVAYWGADGDGAALRDFVRKKLPEYMIPAEVWLPALPLSANGKV 

DRAALPDPAAVAPRAAAVAPRTATERLIASVIiAEVLQVEAVGVTDNLFELGFTSLLLVRAQRL 

LAER I AARAPDEGAAAQAVSIiTDLFQYPT I EQLAQRLDAATVKAE PADVGAQRAEARRDARRR 
RGRG 



SEQ ID No 110 (Contig 12 >ORFl) 

PPAVRRYVADRRPEQLPALAPEEREAAARRLSALGAAPPQVRRRGLTRAPLSYGQSRIYFLEQ 
LSPGKPLFNVPGAVRLRGPVDVARLSAAFGEIVRRHDALRTSIANVDGELLQIAQPHAGFALD 
VVTSTPEEAAELDRRLRAEAWRPFAIGAPPLLRATLFRLAEDEHVLLVTMHHWSDDWSLGVI 

LRELLALYAGRSLPPPRLQVSDFAAWQREMVESGALDGQRAYWRERLRGLSRAS I SAGGGAEA 
PSHDPSGAIEEIALSPDKAAALEALARREGATLFMVLLALLDLVIHARSGALDIAVGTPIANR 

NRPELEDWGLLTNTLVI RVDLARAGAFRDVLARARVQALDAFANQD I PFDWTQDLKQERDH 
AQHPLFRVWLALQNAPKPALEVRGLRVEPLPLRPELVHFEVALLLWPADDGSWGHFEFRRDR 
VDEGARKEIAAAFTHLVDAVIARPDAPVSTLVEGARAEAARAQAALGEAFARAATARLGQLRR 

RSAGDRTPRE 

SEQ ID No 111 (Contig 12 >ORF2) 

MSIHIEEEGRADAAKAPPFDYLQALHSALAHENDPVKRKQIEAGMVFKWLREEPLPFLSQLRR 
EKPIFSIPAITLVTRYMDVVEVLNANDVFSVDNIAPKLVEIWGQNIliAMNDSPKYEHEKSLLR 
LAFPRADLPRYRQIVVDEANRLLAKVGVDKPFDLTGDYALRVPAGAMARYLGVGEIPTEKVVA 

WTHALFNEI FLNPTNDPTAVAAARAARQRALPMIDAI VAARKKQLAKS PPPEQP SVLDR YLVM 
QSVPETYESDEGIRDVILGLLMGCVDLSGGAIWALVELMKRPRVLRDALNVVNVEDDAAITG 
YVLEALRFRPPSTGVTSLCVRDYTVGRGTRHEEKVPAGALVMACSASAMHDHEHIDAPDQFRP 
GRLPSRNYLFWESGIHTCHGKYVAILHISLAIKQLLRAGVPSAIDPMPRVHGYPAPFRVRLAA 

AEG 
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SEQ ID No 112 (Contig 12 >ORF3) 

MREPSSTPNWRNFGSNLPAGSDSVPPGEGFPIKKILALNLGKWKDTAGLQIAQALHLFEYGYK 
RYREGKFVLRATSDLGLGAI FES IDNWES FDQFEEFFKPWTF I RKPLVATRWAEDAE FGRQRL 
VGINPAHI RRATPADLADFVSGAE PKP I AI ADGRTLEEVREGGQLYF LDYR I FKD I VDTDVQE 
ELGKYPLAPTCMLHQTAAGELLPVAIRLVHSRPGKGAHPDKIFTPSGPSDDWLTAKIAVASAD 

AI YQGQVTH LL YAHL I VE P FAVS TYRNL PATH PLHQLLRPHF FNTLAINELARRRFLGRGRF F 
DITSSVATMGSFELLTRAYTGKGIKGYGGKPWRFYESALPRDLSARDVRDLVGYHYRDDALLH 
WDAIQEYVGQVLKIAYPTPGSLSSDASLQRWIHELVSPQLGGMDSLLPPERADQLEKLTSLDD 
LI AI VTNI I FTATAYHAAVNFGQTD YYTW I PNAQFATYRSYGDVLNGSEKRQFKPLERLPGRA 
QSIRQMVLSRSLSMGPPLTSESLMTMKCLLQDPAAKQAFARYRERLAHIEREITERNRAREQP 

YLYLLPSMVPQSVAI 

SEQ ID No 113 (Contig 12 >ORF4) 

VSSSRSTGRVPRDRASPAGSCAPALVPGPPLSYASVMPPLDLHVALFGASGAGKTVLLAAFYR 
AQTQPSFQQEYAYKIQAVNKAQGNQLLGRFYRLEEGRFPDGSTRFDEYEFDFFPRDLPEPAVR 

I HWYDY PGRWWEDEPVDADEREAMRQGL I RLGMSQ VG I LLADGAKYRAEGTGY I RWLFEH FAD 
ECDRLRRASAATGDEVSFPREWILALSKADLCPPDYSARDFEREVCRDADDQLAKLCSVLRAE 
HAFGHRFMLLSSVAAPAGAQVDPRTSLGVRTLAPAILVSTVEGAVREAQAARKEKSAGETFFQ 
GLRDLVQFVDSLDDFLPKRYQIVSKI LRFISIKDFATTRLDRLKKMREDAIRKGDTFTAVLTA 
MVAALRDD EGARAYHQNQ 

SEQ ID No 114 (Contig 12 >ORF5) 

MPAPAPLVETSRLLWRTRGEHWDYEFICVPEIPALPAWLSTLEAMLADADAGAGELRYGLLEI 
DDRGQRAPRAYPYVAVRFLDPARRDWTGRQVQHFAAWFPPVPPEAVAELPEAVPADWHLRVLD 
GLAGTYGSGEVFGLPEATIRAWKRSHDESRAARAMAIVKATPPVSLGGGEAAPSRWTRVPTLK 
KKPPEPPAAAGLLSVGAVPSGQGRRFGCFAIGAMMLAAFCRLMLACGVRLLGA 

SEQ ID No 115 (Contig 12 >ORF6) 

VRFRSSLGPLLLAALGAALTVSAAFRSAEASVFDSASRWPEDADGHVRIPVCIDPTSSAEQRV 
DGAAGGLIHAPNPSLADVITRVRTALQGSWERWSSVRFTGWESCDSLLPATRMTYVGVRIHPD 
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APNQSDSXGVYNKGGSVQFKPWGADFNRCIKYNWQTARVEYSFDCVEQYAIHEMGHAIGFMHE 
WKHPLVPSACSQREPLPASDVASGWPSSRRYIWNPGFYDYDSIMTYWSGCSDQDGVRFGSET 
LDAVDIQAVATVYPPVGGAPDVCNPGWFAGKRWFCAAQPTVSVGNSCSSGWVECLPHCNPRPF 

QGEWWTCPTNPYAVTGQSCSARWELCGD 

SEQ ID NO 116 (Contig 12 >ORF7) 
VGESQGALVGGNALSTNALNLNALNLNALN 

AASCALSSTASFTDFSWTOSNGKRHDERYPGLLGVAPAWASGPLDDAGQRIVSSCVAARVNYYQ 
VPVLLSARSLRDPLKTLSSSQELIDYPDVEGAFWGNLFAAQPYINACYNSATVDNSRAYQRDC 
AAGHVTSGGQXVECGLIRIAGSCDRVCQKLNGAGQYYPSCVDRPGQSTATTKDVITTALP 

SEQ ID No 117 (Contig 12 >ORF8) 
VLAAHCERGGLTARAASLLARGAELAAARRAYLDAEGCYGRVEALI^ 

LGRHTEALADLAAAREAAAAASEAGAEIELLLDEAMILDWTGEYRAARERVAAAERLAGRVAS 
PLLGARLLLGVGRSLHRADREDEAAAVLTRAAAQAARLGDEGHETHirALLLLGFILASLGRV 
ESAARDLDAVILSCEERSDLMHLGAALNNRGLARALQGDRAGMIADFERTIALGRELGQPAFE 
LVGRYNl^EYLYLMDDIJ^AARPHARAVQAIAPRCGDRHAPVVVTLLIARLRLYQGDEAGARRI 
ALRLRAARDDAGCEALKPSEDVLCAMIELATRDDDRAAWAALEERSARCSVGQERIEVLEARA 
IJ^UjRRGRRADARAQLERALAAASTIPTVMGGRIjRRVr^AELTRATESDAPDIDLAAA 

ARAREKVEY 

SEQ ID No 118 (Contig 12 >ORF9) 

QAYPDXjWAERGRQELWLRQLP PRACAQLAREALGDAADGALI DRLVTQSEGQPF FLEELI RAT 
AEGRGDALPETWAMVQWLEA^PPARRILRAASVLGEVFWRGAVAHLLGGDEAAPLAEHLS 
ALVAGELCVRHREGRFPGEEEYSFRQALLREGAYAQLTKDDRALGHRLAADWLEAAGEADPLV 
-LAAHCERGGLTARAASLLARGAEIAAARRAYLDAEGCYGRVEALLGALLPEERRARGLARFRL 

GRHTEALADLAAAREAAAAASEAGAE I ELLLDEAMI LDWTGEYRAARER VAAAERLAGRVAS P 
LLGARLLLGVGRS LHRADREDEAAAVLTRAAAQAARLGDEGHETHI I ALLLLGF I LASLGRVE 
EAARDLDAVILSCEERSDLNIHLGAALNNRGLARALQGDRAGM 

VGRYNLAEYLYLMDDLAAARPHARAVQAIAPRCGDRHAPVWTLL I ARLRLYQGDEAGARRI A 
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LRLRA/^RDDAGCEALKPSEDVLCAMIELATRDDDRAAWAALEERSARCSVGQERIEVLEARAL 
AALRRGRRADARAQLERAIW^STIPTVMGGRLRRWYAELTRATESDAPDIDLAAAEATFTGA 

RAREKVEY 

14 . DNA sequence according to any of claims 1 to 5 wherein the 
DNA is selected from the group consisting of 

(a) the following DNA sequences: 

Seq ID No 119 (>Contigl7) 

TTACGTTACTCATCCTATCTCGGCACCCTGTGTCGGTGATGTCGCTCGCC 
TCGAGCGCGAGCGGGACGACGTCGGCGCCGCGCTCGGTGAGCGCCGCCGC 
GAGGGCGCTCGCGAGATCGCTGGCGACGCCGGCCGGGGCCACGACGAGCC 
ACGTCCCCGCGACGTCGCCGCGTGACGCGGCGCTCACGGGTCTCCATTCG 
ACGCGGTAGCGCCACGCGCCCACGGTGCTCTGCTCTCGGCGGCTCCGCCG 
CCACGCCGACAGGGCCGGCATGAGGCTCTCGAGGGCCGAGCGCCGCCCGC 
TGTCGGCGACGTGGAGCGCGTCCGAGAGCGCCGCGACGTCGCCGCGCTCG 
ATGGCTCGCCAGAACGCGGTCTCCTCGGCGGACGCTCCCGGCGCCGCGTC 
CTCATCGTCCGACGCGTCGCCTGCGTCGAGCCAGAACCGCTCGCGCTGGA 
ACGCGTACGTCGGCAACGTCACGCGGCGCGCCCCGAGCGGAGCGAAGAAC 
GCACCCCAGTCGATGGCGTGCCCGCGCGCGTGGAGCTCGCCTGCCGAGAG 
GAGGAAGCGCTCGAGGTCGCCTTCGTCGCGGCGGAGCGAGGACACCACGG 
TCGCATCGCCGTCGATCGACGAGAGCGTCTCGTCGAGCGCGACGGTGAGC 
ACGGGGTGAGGGCTGACCTCGACGAAGAAGCGGTGGCCGTCGTCGAGCAG 
GGCGCGCGTGGCGTGCTCGAAGCGGACGGTGTGGCGCAGGTTTCGGTACC 
. AGTGGGCGGCGCCGAGGGCCTCGCCATCAAGCCTCTCGCCCGTCACCGCG 
GAGTAGAGCGGCACGGTCGCCGGGCGCGGCGCGATGCCGTCGAGCGCCTC 
CAGCATCGTCCGCTCGATGGCCTCCACGTGGGCGGAGTGGGAGGCGTACT 
CGACGCGGACCTTGCGGGCGAACAGCTGCGCCCCGCTCAGCTCTGCGACG 
AGCTCGTCGATAGCGCCGGGGTCTCCGGAGACGAGGGCCGCGTGAGGGCT 
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25 



45 



Is. 



GTTGATCGCCGCTATCGCCAGGCGTTCGCCCAAGGGCGCAAGGCGCGCCT 
CGAGCTCGGCGGTGGTGAGCTCGACGGCGGACATGGCGCCGCGTCCCGCG 
AGCTTCGTAATGGCGCGCGAGCGGAGCGCGACGACCCTGGCGGCGTCTTC 
TAGCGAGAGCGCGCCCGCGACGTACGCGGCCGCGATCTCGCCCTGGCTGT 
5 GGCCGACGACCGCGTCGGGCGTGACTCCGGCGGCGCGCCAGGTGGCGGCG 
15 AGGGCGATCATGACGGCGAACAGCACGGGCTGCACCACGTCGACGCGCTC 

GAGCATGGGCGCGGCGTGCGCTTCGTCGCCGCCGAGCACGGCGAGGAGCG 
ACCAGTCGACGTGCGGCGCCAGGGGGCGCTCGCACGCCTCGATCTCGGCC 
CGAAAGGCGGGCGAGGAGGCGAGCAGAGCGCGCGCCATCGATGGCCACTG 

20 

1 0 CGAGCCCTGGCCGGGGAAGACGAAGGCGACCTTGCCCGGCGGGAGCGCCT 

■» 

CGCCCGCGACCGTTCCTGCCCCCGCGCGCCCCTCGGCGAGCGCCGCGAGC 
GCCGAGAGCAGCGCGGCGCGATCGTCTGCCACGACGGCGGCGCGACGCTC 
GAAATGCGACCGCGTGGTCGCGAGCGACGCCGCGACGTCGACGAGGGCGA 
CGTCCTCGTGCTCGGCGAGGTGCGCGTGGAGCTTGCCCGCCTGAGCGCGG 
AGCGCCGCGTCGCTCTTCGCCGAGAGGAGCACCGGCACCGGCGGCGCGAA 
GGGCGCGCGGGCGGGCTCCCCGGCCTGGTCGTCGCCGGCCGCCGCGCGCG 
30 GCGCTTCCTCGAGGACCACGTGCGCGTTGGTGCCGGAGATCCCGAACGAC 

GACACCGCCGCGCGCCGAGGAGACCCGCCTGGCTTCCACGGTACCTCCTC 
GGTCAAGAGGCGGATCGCGCCGGACGACCAATCGATGTGCTGCGACGGGC 
20 TCGCGGCGTGGAGCGTCCTCGGGAGGACGCCGCTCTGCAGCGCGAGCACC 

35 

ATCTTGATGACGCCGCCGATCCCCGCGGCGGCCTGCGTGTGCCCGAGGTT 
CGACTTTAC3GCTCCCGAGCCACAGCGGGCGCTCCTTCGCGTGCGCCGCGC 
CGTACGTCGCGAAGAGCGCGCGCGCCTCGATGGQATCGCCGAGCGTCGTG 
40 CCGGTTCCGTGCGCCTCGACGGCGTCGACGTCCGCGGGGGCGAGCCCCGC 

GCTCGCGAGCGCGTCCCGGATCACGCGCTCTTGCGCGGGGCCGTTCGGCG 
CCGTGAGCCCTTGGCTCTTGCCGTCCTGGTTGACGGCCGATCCGCGCACG 
ATCGCGAGCACGGGGTGCCCGTTCTTCCGGGCGTCCGACAGGCGCTCGAG 
GAGCACTATCCCAGCGCCTTCCGACCAGCCCGCGCCGTTCGCGTGCGACG 
AGAACGACTTGCACCGCCCGTCCGGCGCGCCCGCGTGCTGCGCGCTGAAC 
1 0 TCGCCGAAGATCCCGGGGGTCGCCATCACGGTCACGCCGCCGGCGAGCGC 

50 



55 
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35 



GAGCGAGCACTCGCCTCGACGGATGGCGTGGCAGGCGAGGTGGAGCGCGA 
CGAGCGACGAGCTGCACGCCGTGTCGACGCT 



Seq ID No 120 (>Contigl8) 
5 TTTTAGGANCCCCGACGTGCACGATCGGCTCGCCAACCTCGTGGCGCGCC 
15 GGGACTATTTTTACCAGCTCGCGTTGCGCGCCGCGGGGACCTACGTGCGG 

GGCCTCGTCCGCGCCCCGCACGACGGCGCGCGCCCCCCCGCGTTCGCGCC 
GCGTGGGGCGGCGCTCGTCACGGGCGGGACCGGGGCGCTCGGGGCGCACG 
TTGCCCGTTGGTTCGCGCGGATCGGCGCCGAGCACATCGTGCTCGCGAGC 

20 

10 CGCCGCGGAGCCGCGGCCCCCGGCGCGGCCGCGCTCGCCGAGGAGCTTTC 
GGTGCTCGGCGCGCGCGTGACGCTGGTTGCGTGCGACGTCCCCGATCGTG 
AGGCGGTCGCGGGGCTCGTGCGCAACGTCAAGGCCGGCGGAGCGACGGTG 
oc CGCGCCGTGTTCCACGCGGGCGC5TGCGATGCACGAGGCGCCGGTCGCCGC 

CATGCGTGTTGAGGAGCTCGCCGACGCGATCGCCGTGAAGGCCCGCGGCG 
1 5 CGCAGCACCTCCAAGACGTCTTCGCGCAGCGCCCGCTCAACGCGTTTGTC 
CTCTTCTCGTCAGAAACGGGTGTGTGGGGCGGTGGCCGGCAAGGCGCGTA 
30 CGCCGCGGCGAACGCGTTCCTCGACGCGCTCGCCGAGGCGCGTCGCGCGG 

ACGGCCTCGCGGCGACCTCGATCGCGTGGGGCGCGTGGGCGGGCGGCGGA 
ATGCTCGCGACCGACGCCGAGCGGCGCTTGAAGCATCGCGGCGTCGCGCC 
2 0 GATGGATCCGGAGCTCGCCGTCGCGGCCCTCGCGCACGCGCTCGATCACG 
CCGAGACGTGCCTCGCCGTCGCTGACGTCGACTGGGCGCGCTTCGCCCCG 
TCGTTCGCCTCGGCGCGTCCTCGCCCGCTCCTCGACGAGCTCGCGGAGGC 
GCGATCGGCGCTCGACGCGCTGCGCGAGCCACCGGACGACGCGCGCACGG 
40 CCGCCGGTCGCGAGCCCGCAAGCACGCTGAGGACCACGCTCGCGGCGCTC 
2 5 CCGGAGGGCGAGCGCCACCGCCACCTCCTCGCGCTCGTGCGGACGGAGAC 
GGCGGCGGTGCTCGGGCACGCGGACGCGTCGCGCGTCGAGCCGAACCGCG 
GGTTCTTTGACeTCGGGCTCGACTCGCTCATGTCCGTCGAGCTCCGCAGG 
45 CGCGTCCAGCGCGCGACCGGCATCAAGCTCCCGGCGACGCTCGCGTTCGA 

CCACCCGACGCCGAGCGCGCTCGCGAGCAAGGTGCTCGCCGCGATCGTCC 
30 TCCACGACGCGACCCCGCGCGCCTCGCCCGCCGCGGAGCTCGAGCGCCTC 

50 



55 
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GAGGGGATGCTCTCGGCGATCTACGCGGACGAAGCGCTCCGCGACGACCT 
CACGGCGCGCCTCCGCGCCTTCCTGGACAAGCGCGCGGTCCGCACCGAAC 

10 

GCCCCGACGACGCCGCGTTCGCCGAGAAGCTCGGCTCCGCGAGCGCCGAC 

GAACTCATTCGCCTGATCGATCAGAAGCTCGGAGATCGCATCGATGTCGA 

5 CCGTTACTAACGACACGCTCACGGAGTACTTGCGGCGCCTCACTCAAGAG 

15 CTCCACAGGAGCGAGACC-CGCCTGCGTGCGACGGAAGAGAGGCGACATGA 

GCCGATCGCCATCGTCGGCCTCGGGCTCCCCTTCCGGGGCGGGATCCACG 

ACCGCGACACGCTCTGGACGTTCCTCGAGGAGGGCCGCGACGCCATCGCG 

CCGATCCTCGCGAGCCGCTGGAACGCGGACGCGACGTACGACCTCGATCC 

10 GGACGCCGTCGGCAAGAGCTACGTGCGCGACGCCGCCATGCTCGATCGCG 

TCGACCTTTTCGACGCCGATTTCTTCGGGATCAGCCCGCGCGAGGCGAAG 

TACGTCGACCCGCAGCACCGCCTCTTGCTCGAGACGTCGTGGCAAGCGCT 

CGAGGACGCGGGGATTGTGCCGGCGTCGCTGCGAGACTCGAAGACCGGCG 
25 r 

TCTTCGTCGGCACGGGCGCGAGCGACTACGCGTTCCTCCAGAGCGATCGC 

1 5 GACGCCTCGGAGGCGTACGCGTTCATGGGGATGATCTCGTCGTTCGCGGC 

GGGCCGCCTCGCGTTCACGCTCGGGCTCCAAGGCCCCGCGCTATCGATCG 
30 ACACGGCGTGCTCTTCGTCGCTCGTCGCGCTCCACCTCGCGTGCCAGTCG 

CTGCGTCAAGGCGAGTGCGACCTCGCGCTCGTCGCGGGTGTGCAGGTCAT 
GTCGTCGCCGGAGGTGTTCGTGCTGCTCTCGCGCACGCGCGCGCTCGCGA 

2 0 GCGACGGGCGATCGAAGACGTTCTCGGCGAACGCCGACGGCTATGGCCGC 

35 

GGCGAAGGCGTCGTCGTCCTGGCCGTCGAGCGCCTCCGCGACGCGCGCGC 
GAAAGGGCGCCCGATCCTCGCGGTGATCCGCGGCAGCGCGGTGAACCACG 
ACGGCACGTCGAGCGGGATCACGGTCCCGAACGGGCCCGCGCAGCAGAAG 
40 GTGCTCCGCGCCGCGCTCGACGAC3CGCGGCTTGTCCCCGCCGACGTCGA 

2 5 CGTCGTCGAGTGCCACGGCACGGGGACCTCCATCGGCGATCCCATCGAAG 

TGAACGCGCTCGCCGCCGTCTACGGCGAGGGGCGCCCCAAGGACCGCCCG 
CTGTTCCTGGGCGCGCTGAAGACCAACATCGGGCACCTCGAGTTCGCGTC 
45 GGGCCTCGCCGGCGTCGCGAAGATGGTCGCCTCCATGCGCCACGCGACCC 

TCCCCGCGACGCTGCACACGAGCCCGCTCAACCCGCTCGTCGACTGGGAC 

3 0 GCGCTCCCCGTGCGCGTCGTCGACGCCGCGCGCCCGTGGACGCGCCGCGA 

50 
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10 



20 



CGACGGCGCCCCCCGGCGCGCCGGCGTCACGGCGATCGTCGAGGAGGCGC 
CCGCCGAGCCCGAGCCCACGACGCCCGACGCCGCGCCCGCGCTTCCGGCC 
GTGCCCGTTCTCCTCTCGGGCAAGACCGACGAGGCGCTGCGCGCGCAGGC 
AGCGCGCCTCCACGCGCACCTCGCGGGGCGCCCCGACGCGCGGCTCGTCG 
5 ACATCGCCGCGTCGCTCGCGACGACGCGCACGCACTTCGATCGACGCGCG 
15 GCCGTCGTCGCGGCGGATCGCGACGAGCTCCTCGGCGCGCTCGACGCGCT 

CGCGCGCGGCGAGGCAGGCCCGGGGTCGGTCGTCGCGAGCGCGATCCCCG 
CCGGCAGGGTCGTGTTCGTGTTCCCCGGCCAAGGCTCGCAGTGGGTCGGG 
ATGGCGCGCGCGCTCCTCGCGTCGTCGGTGGTCTTCCGCGACGAGATCGC 
1 0 GGCCTGCGAGCGCGCGCTCGCGCCGCACGTCGCCTGGTCGCTCGGCGCCG 
TTCTCCGGGGCGACGGCGACGAGGCGACGCTCCTCGGCCGCGTCGACGTC 
GTGCAGCCGGTCCTCTTCGCCGTCATGGTCGCCCTCGCCGCGCTCTGGCG 
CTCGATCGGCGTCACGCCCGACGCCGTCGTCGGGCACAGCCAAGGCGAGA 
TCGCCGCCGCCTACGTCGCCGGCGCCCTCTCGCTCGAAGACGCCGCCAAG 
1 5 GTCGTCGCGCTGCGCGCACGAGCGCTCACGAAGATCGCGGGGCGCGGGGC 
GATGGCCGCCGTCGAGCTCGGCGCACGCGACACCGAGGCGCGCCTCGCGC 
30 CGTTCGGCGACGCCATCGCGATCGCGGCGATCAACAGCCCGCGCGCCACG 

CTCGTCGCGGGCGACACGGACGCGATCGACGCGCTCGTCCGCGACCTCGA 
GGCCGCGCAGATCTTCGCGCGGAAGGTGCGTGTCGACTACGCGTCGCACT 
2 0 CGGCGCACGTCGAGGCGATCGAGCGCGAGCTCCTCGCGGATCTCGCGGGG 
ATCGAACCGCGCGCGGGCGCTGTGCCGCTTTACTCCGCGGTGACGGGCGC 
GAAGCTCGACGGGAACCGCCTCGACCCCGCGCATTGGTTCCGGAACCTGC 
GCTCGACAAAAAACTTTGAGGACGCCACGCGCGCGCTCCACGACGACGGC 
40 CGCCGGGTATCCTCATNATCNNGGGCGTNCAGAGGAGTCGGTATTNCCCC 

2 5 CCCCCGCCTTNCCCG , 
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(b) DNA-sequences which hybridise under stringent conditions 
to regions of DNA-sequences according to (a) encoding proteins 
or to fragments of said DNA-sequences, 

(c) DNA-sequences which hybridise to the DNA-sequences accord- 
ing to (a) and (b) because of a degeneration of the genetic 
code , 

(d) allele variations and mutants resulting by substitution, 
insertion or deletion of nucleotides or inversion of nucleotide 
segments of DNA-sequences according to (a) to (c) , wherein the 
variations and mutants offer i so functional expression products. 

15. Peptide encoded by a DNA sequence according to claim 14 
selected from the group consisting of 

Seq ID No 121 

>Contigl7_001 828 amino acids MW=86259 D pl=5.60 numambig=l 
MT VMAT PG I FGE F S AQHAGAPDGRCKS F S S HANGAGW S EGAG I VLLERLSDARKNGHPVL 
AIVRGSAVNQDGKSQGLTAPNGPAQERVIRDALASAGLAPADVDAVEAHGTGTTLGDPIE 

ARALFATYGAAHAKERPLWLGSLKSNLGHTQAAAGIGGVI KMVLALQSGVLPRTLHAAS P 
SQHIDWSSGAIRLLTEEVPWKPGGSPRRAAVSSFGISGTNAHWLEEAPRAAAGDDQAGE 
PARAPFAPPVPVLLSAKSDAALRAQAGKLHAHIAEHEDVALVDVAASLATTRSHFERRAA 
WADDRAALLSALAALAEGRAGAGTVAGEALPPGKVAFVF PGQGSQWPSMARALLAS S PA 
FRAEIEACERAIJIPHVDWSLLAVLGGDEAHAAPML 

. VTPDAVVGHSQGEI AAAYVAGALSLEDAARVVALRSRAI TKLAGRGAMSAVELTTAELEA 
RIJVPLGERLAIAAINSPi^LVSGDPGAIDELVAELSGAQLFARKVRVEYASHSAHVEAI 
ERTMLEALDGIAPRPATVPLYSAVTGERLDGEALGAAHWYRNLRHTVRFEHATRALLDDG 
HRFFVEVSPHPVLTVALDETLSSIDGDATWSSLRRDEGDLERFLLSAGELHARGHAIDW 
GAFFAPLGARR VTLPTYAFQRERFWLDAGDAS DDEDAAPGASAEETAFWRAI ERGDVAAL 
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SDALHVADSGRRSALESLMPALSAWRRSRREQSTVGAWRYRVEWRPVSAASRGDVAGTWL 
WAPAGVASDLASALiAAALTERGADVVPLALEASDITDTGCRDRMSNVX 

Seq ID N 7 o 122 

>Contigl8_002 502 amino acids MW=53019 D pl=6.83 numambig=l 

FRXPDVHDRLANLVARRDYFYQIJUjRAAGTYTOGIjVRAPHDGARPP 

GALGAHVARWFARIGAEHIVLASRRGAAAPGAAALAEELSVLGARVTLVACDVPDREAVA 

GLVRNVKAGGATVRAVFHAGGAMHEAPVAAmVEELADAIAVKARGAQHLQDVFAQRPLN 

AFVLFS S ETGVWGGGRQGAYAAANAF LDALAEARRADGLAATS I AWGAWAGGGMLATDAE 

RRLKHRGVAPMDPEIAVAALAHALDHAETCLAVADVDWARFAPSFASARPRPLLDELAEA 

RSALDALREPPDDARTAAGPEPA3TLRTTLAALPEGERHRHLLALVRTETAAVLGHADAS 

RVEPNRGFFDLGLDSLMSVELRRRVQRATGIKLPATLAFDHPTPSALASKVLAAIVLHDA 

TPRASPAAEIiERLEGMLSAIYADEALRDDLTARLRAFLDKRAVRTERPDDAAFAEKLGSA 

SADELI RLIDQKLGDRIDVDRY* 
Seq ID No 123 

>Contigl8_010 840 amino acids MW=88062 D pl=5.74 numambig=6 
MSTVTNDTLTEYLRRLTQELHRSETRLRATEERRHEPIAIVGLGLPFRGGIHDRDTLWTF 
LEEGRDAI AP I LASRWNADATYDLDFDAVGKS YVRDAAMLDRVDLFDADFFGI S PREAKY 
VDPQHRLLLETSWQALEDAGIVPASLRDSKTGVFVGTGASDYAFLQSDRDASEAYAFMGM 

I S S F AAGRLAFTLGLQG PALS I DTAC S S S LVALHLACQS LRQGE CDLALVAGVQVMS S P E 
VFVLLSRTRAIASDGRSKTFSANADGYGRGEGAAAnjAVERLRDARAKGRPILAVIRGSAV 
NHDGTSSGITVPNGPAQQKVLRAALDDARLVPADVDVVECHGTGTSIGDPIETVNALAAyY 
GEGRPKDRPLFLGALKTNIGHLEFASGLAGVAKMVASMRHATLPATLHTSPLNPLVDWDA 
LPVRWDAARPWTRRDDGAPRRAGVTAIVEEAPAEPEPTTPDAAPALPAVPVLLSGKTDE 

ALRAQAARLHAHLAGRPD ARLVD I AAS LATTRTH FDRRAA WAADRDELLGALDALARGE 
AGPGSWASAIPAGRWFVFPGQGSQWGMARALLASSVVFRDEIAACERALAPHVAWSL 

G AVLRGDGD EATLLGRVD WQP VL F A VMVALAAL WRS I GVT PD A W GHS QGE I AAA Y VAG 
AL S LEDAAKWALRARALTKI AGRGAMAAVELGARDTEARLAP FGDA I AI AAI NS PRATL 
VAGDTDAI DALVRDLEAAQ I FARKVRVDYASHSAHVEAI ERELLADLAGI EPRAGAVPLY 
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S AVTGAKLDGNRLDPAHWFRNLR3TKNFEDATRALHDDGRR VS S X SXAXRGVG I XPPRLX 
X 

16. Recombinant expression vector which comprises a DNA- 
sequence according to any of claims 1 to 10, 12 and 14. 

17. Procaryotic or eucaryotic cell which has been transfected 
or transformed with a DNA-sequence according to any of claims 1 
to 10, 12 and 14 or with a recombinant expression vector ac- 
cordina to claim 16. 

18. Cell according to claim 17, v/herein the cell is derived 
from myxobacteria . 

19. Cell according to claim 17, wherein the cell is derived 
from a Sorangium strain. 

20. Cell according to claim 17, wherein the cell is derived 
from Sorangium cellulosum. 

21. Cell according to claim 17, wherein the cell is derived 
from a Streptomyces strain. 

22. Cell according to claim 17, wherein the cell is derived 
from Escherichia coli. 

23. Process for an enzymatic biosynthesis, mutasynthesis or 
partial synthesis of polyketide or heteropolyketide compounds, 
wherein a cell according to any of claims 17 to 22 is culti- 
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vated in a suitable culture medium and the polyketide or het- 
eropolyketide compound is isolated from the medium. 

24. Process according to claim 23, wherein the polyketide or 
heteropolyketide compound is an epothilone. 
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